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TITLE - 

NUCLEOTIDE SEQUENCES OF 
SOYBEAN ACYL-ACP THIOESTERASE GENES 
CBaaSiBEmEMCE TO RELATED APPLICATION 

This application is a continuation-in-part of 
application U.S. Serial No. 07/631 f 264, filed 
December 20, 1990. 

FIELD OF THE INVENTION 

The invention relates to isolated nucleic acid 
fragments that encode plant seed acyl-ACP thioesterase 
enzymes or its precursor. Such fragments are useful in 
a method to alter plant oil composition. 

BACKGROUND OF THE INVENTION 

Soybean is the lowest-cost source of vegetable oil. 
Soybean oil accounts for about 70% of the 14 billion 
pounds of edible oil consumed in the United States and 
is a major edible oil worldwide. It is used in baking, 
frying, salad dressing, margarine, and a multitude of 
processed foods. Soybean is agronomically well-adapted 
to many parts of the U.S. In 1987/88 sixty million 
acres of soybean were planted in the U.S. Soybean 
products are also a major element of foreign trade since 
thirty million metric tons of soybeans, twenty-five 
million metric tons of soybean meal, and one billion 
pounds of soybean oil were exported in 1987/88. 
Nevertheless, increased foreign competition has lead to 
recent declines in soybean acreage and production in the 
U.S. The low cost and ready availability of soybean oil 
provides an excellent opportunity to upgrade this 
commodity oil into higher value speciality oils that add 
value to soybean crop for the U.S. farmer and enhance 
U.S. trade. 

The specific performance and health attributes of 
edible oils are determined largely by their fatty acid 
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composition. Soybean oil derived from commercial 
varieties is composed primarily of 11% palmitic (16:0), 
4% stearic (18:0), 24% oleic (18:1), 54% linoleic (18:2) 
and 7% linolenic (1B:3) acids. Palmitic and stearic 
5 acids are, respectively, 16- and 18-carbon-long, 

saturated fatty acids. Oleic, linoleic and linolenic 
are 18-carbon-long, unsaturated fatty acids containing 
one, two and three double bonds, respectively. Oleic 
acid is also referred to as a "monounsaturated" fatty 

10 acid, while linoleic and linolenic acids are also 
referred to as "polyunsaturated" fatty acids . The 
specific performance and health attributes of edible 
oils is determined largely by their fatty acid ^ 
composition. j- 

15 Soybean oil is high in saturated fatty acids when . j= 

compared to other sources of vegetable oil and contains 
a low proportion of oleic acid relative to the total 
fatty acid content of the soybean seed. These 
characteristics do not meet important health needs as 

20 defined by the American Heart Association. 

Recent research efforts have examined the role that 
monounsaturated fatty acid plays in reducing the risk of 
coronary heart disease. In the past, it was believed 
that monounsaturates, in contrast to saturates and j- 

25 polyunsaturates, had no effect on serum cholesterol and i 
coronary heart disease risk. Several recent human \ 
clinical studies suggest that diets high in 
monounsaturated fat may reduce the "bad" (low-density 
lipoprotein) cholesterol while maintaining the "good" 

30 (high-density lipoprotein) cholesterol. (See Mattson, 
et al., Journal of Lipid Research (1985) 26:194-202). 
The significance of monounsaturated fat in the diet was 
confirmed by international researchers from seven 
countries at the Second Colloquium on Monounsaturated . j 



Fats sponsored by the National Heart f Lung and Blood 
Institutes in 1987. 

Soybean oil is also relatively high in 
polyunsaturated fatty acids — at levels far in excess 
of essential dietary requirements. These fatty acids 
oxidize readily to give off-flavors and reduce the 
performance of unprocessed soybean oil. The stability 
and flavor of soybean oil is improved by hydrogenation, 
which chemically reduces the double bonds. However, 
this processing reduces the economic attractiveness of 
soybean oil. 

A soybean oil low in total saturates and 
polyunsaturates and high in monounsaturate would provide 
significant health benefits to human consumers as well 
as economic benefit to oil processors. Such soybean 
varieties will also produce valuable meal for use as 
animal feed. 

Another type of differentiated soybean oil is an 
edible fat for confectionary uses. More than two 
billion pounds of cocoa butter, the most expensive 
edible oil, are produced worldwide. The U.S. imports 
several hundred million dollars worth of cocoa butter 
annually. The high and volatile prices and uncertain 
supply of cocoa butter have encouraged the development 
of cocoa butter substitutes. The fatty acid composition 
of cocoa butter is 26% palmitic, 34% stearic, 35% oleic 
and 3% linoleic acids. Cocoa butter's unique fatty acid 
composition and distribution on the triglyceride 
molecule confer on it properties eminently suitable for 
confectionary end-uses: it is brittle below 27°C and 
depending on its crystalline state, melts sharply at 
25-30°C or 35-3 6°C. Consequently, it is hard and non- 
greasy at ordinary temperatures and melts very sharply 
in the mouth. It is also extremely resistant to 
rancidity. For these reasons, a soybean oil with 
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increased levels of palmitic and stearic acids, 
especially in soybean lines containing reduced levels of 
unsaturated fatty acids, is expected to provide a cocoa 
butter substitute in soybean. This will add value to 
5 oil and food processors as well as reduce the foreign 
import of certain tropical oils. 

The partial purification of acyl-ACP thioesterase 
was reported from saf flower seeds (McKeon et al., (1982) 
J. Biol. Chem. 257:12141-12147). However, this 
10 purification scheme was not useful for soybean, either 
because the thioesterases are different or because of 
the presence of other proteins such as the soybean seed 
storage proteins in seed extracts. 

c^ftgy or TFF TWVF.WTTON 

15 A method to alter the levels of saturated and 

unsaturated fatty acids in edible plant oils has been 
invented. Isolated soybean seed acyl-ACP thioesterase 
cDNAs for either the precursor or enzyme were used. to 
create chimeric genes. Transformation of plants with 

20 the chimeric genes alters the fatty acid composition of 

the seed oil. 

The invention is nucleic acid fragments comprising 
a nucleotide sequence encoding a plant acyl-ACP 
thioesterase. More specifically, the fragment may be 

25 isolated from soybean, oil producing Brass i ca species, 
Cuohea tn-Bcnsifisima or r.nphea lanceolata. One fragment 
of the invention corresponds to nucleotides 1 to 1602 of 
SEQ ID NO: If or any nucleic acid fragment substantially 
homologous therewith. Another fragment corresponds to 

30 nucleotides 1 to 1476 of SEQ ID NO:3, or any nucleic 
fragment substantially homologous therewith, more 
preferred nucleic acid fragments are nucleotides 106 to 
1206 of SEQ ID NO:l and nucleotides 117 to 1217 of SEQ 
ID NO:3, or any nucleic acid fragment substantially 

35 homologous therewith for soybean seed acyl-ACP 
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thioesterase precursor. Also more preferred nucleic 
acid fragments are nucleotides 271 to 1206 of SEQ ID 
NO:l and nucleotides 282 to 1217 of SEQ ID NO:3 r or any 
nucleic acid fragment substantially homologous therewith 
5 for mature soybean seed acyl-ACP thioesterase. 

Another aspect of this invention is a chimeric gene 
capable of transforming a plant cell comprising a 
nucleic acid fragment encoding soybean seed acyl-ACP 
thioesterase cDNA operably linked to suitable regulatory 

10 sequences such that expression of the gene causes 
altered levels of acyl-ACP thioesterase in the seed. 
Preferred are those chimeric genes which incorporate 
nucleic acid fragments encoding soybean seed acyl-ACP 
thioesterase precursor or mature soybean seed acyl-ACP 

15 thioesterase enzyme. 

A further aspect of this invention is a plant 
transformed with the chimeric genes discribed above. 

Yet another embodiment of the invention is a method 
to produce seed oil containing altered levels of 

20 saturated and unsaturated fatty acids comprising: . 
(a) transforming a plant cell with a chimeric gene 
described above, (b) growing sexually mature plants from 
the transformed plant cells , (c) screening progeny seeds 
from the sexually mature plants of step (b) for the 

25 desired levels of palmitic and stearic acid, and (d) 
processing the progeny seed to obtain oil containing 
altered levels of palmitic and stearic acid. Preferred 
plant cells and oils are soybean, oil producing Brassies 
species, sunflower, cotton, cocoa, peanut, safflower, 

30 and corn. 

The invention is also embodied in a method of RFLP 
breeding to obtain altered levels of palmitic, stearic, 
and oleic acids in seed oil. This method comprises: 
(a) making a cross between two soybean varieties 

35 differing in the trait, (b) making a Southern blot of 



restriction enzyme-digested genomic DNA isolated from 
several progeny plants resulting from the cross of step 
(a) ; and (c) hybridizing the Southern blot with the 
radiolabeled nucleic acid fragments described herein. 

BEIEE pp.gppTPTTOW nr the beottence DESCRIPTIONS 

The invention can be more fully understood from the 
following detailed description and the Sequence 
Descriptions which form a part of this application. The 
Sequence Descriptions contain the three letter codes for 
amino acids as defined in 37 C.F.R. 1.822 which are 
incorporated herein by reference. The nucleotide 
sequences read from 5' to 3'. 

SEQ ID N0:1 shows the 1602 nucleotides of a soybean 
seed acyl-ACP thioesterase cDNA. 

SEQ ID NO: 2 shows the amino acid sequence of the 
precursor protein of a soybean seed acyl-ACP 
thioesterase (the coding sequence of SEQ ID NO:l). 

SEQ ID NO: 3 shows the 1476 nucleotides of a soybean 
seed acyl-ACP thioesterase cDNA. 

SEQ ID NO: 4 shows the amino acid sequence of the 
precursor protein of a soybean seed acyl-ACP 
thioesterase (the coding sequence of SEQ ID NO: 3). 

SEQ ID NOs:5 and 6 show sequences related to the N- 
terminal sequence of acyl-ACP thioesterase. 

SEQ ID NOs:7 f 8 and 9 show respectively a protein 
sequence, DNA sequence and the related hybridization 
probe. 

SEQ ID NOs:10, 11 and 12 show respectively a 
protein sequence, DNA sequence and the related 
hybridization probe. 

SEQ ID NO: 13 shows the sequence of the sequencing 
primer used to identify soybean acyl-ACP thioesterase 
isozymers . 

SEQ ID NOs:14, 15, 16 and 17 show sequences chosen 
from SEQ ID NO:l as probes for identification of 
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acyl-ACP thioesterase genes from the £. viscosisslma 
£. lanceolata genomes. 

SEQ ID NO: 18 shows a PCR primer corresponding to 
bases 83 through 117 in SEQ ID N0:1. 
5 SEQ ID NO: IS shows a PCR primer corresponding to 

bases 274 through 296 in SEQ ID N0:1. 

SEQ ID NO: 20 shows an 1378 base pair, partial 
genomic clone of acyl-ACP thioesteraswe from fi. napus . 

SEQ ID NO: 21 shows an 865 base pair insert 
10 sequenced from £. vifigosissima . 

SEQ ID NO: 22 shows an 852 base pair insert 
sequenced from £. lanceolata . 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention describes two isolated [ 

15 nucleic acid fragments that encode soybean seed acyl-ACP y 
thioesterases These enzymes catalyze the hydrolytic ; 
cleavings of palmitic acid, stearic acid and oleic acid L 
from ACP in the respective acyl-ACPs. 

Only recently have serious efforts been made to 

20 improve the quality of soybean oil through plant 
breeding, especially mutagenesis. A wide range of 
fatty acid compositions have been discovered in 
experimental lines of soybean (Table 1) . Findings from 
work on various oil crops suggest that the fatty acid ]_ 

25 composition of soybean oil can be significantly altered ] 
without affecting the agronomic performance of a soybean j 
plant. However, there is no soybean mutant line with 
levels of saturates less than those present in 
commercial canola, the major competitor to soybean oil 

30 as a "healthy" oil. 
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TABLE 1 

Bangs nf Fwttv Acid 
gg rsgataflfia Pr" H " r » d hv Soybean Mutants 



5 



Palmitic Acid 
Stearic Acid 
Oleic Acid 



rnf+v Ap - ids 



panop of % 

6-28 



3-30 



10 



Linoleic Acid 
Linolenic Acid 



17-50 
35-60 
3-12 



There are serious drawbacks to using mutagenesis to 
alter fatty acid composition. It is unlikely to 
discover mutations a) that result in a dominant ("gain- 

15 of-function") phenotype, b) in genes that are essential 
for plant growth, and c) in an enzyme that is not rate- 
limiting and that is encoded by more than one gene. 
Even when some of the desired mutations are available in 
soybean mutant lines their introgression into elite 

20 lines by traditional breeding techniques will be slow 
and expensive, since the desired oil compositions in 
soybean are most likely to involve several recessive 



25 offer the potential for overcoming some of the 

limitations of the mutagenesis approach, including the 
need for extensive breeding. Particularly useful 
technologies are: a) seed-specific expression of foreign 
genes in transgenic plants (see Goldberg et al., (1989) 

30 Cell 56:149-160), b) use of antisense RNA to inhibit 
plant target genes in a dominant and tissue-specific 
manner (see van der Krol et al., (1988) Gene 72:45-50), 
c) use of homologous transgenes to suppress native gene 
expression (see Napoli et al., (1990) The Plant Cell 

35 2:279-289; van der Krol et al., (1990) The Plant Cell 



genes . 



Recent molecular and cellular biology techniques 
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2:291-299; Smith et al., (1990) Mol. Gen. Genetics 
224:477-481), d) transfer of foreign genes into elite 
commercial varieties of commercial oilcrops, such as 
soybean (Chee et al. (1989) Plant Physiol. 91:1212-1218; 
5 Christou et al., (1989) Proc. Natl. Acad. Sci. U.S.A. 
86:7500-7504; Hinchee et al., (1988) Bio/Technology 
6:915-922; EPO publication 0 301 749 A2), rapeseed (De 
Block et al., (1989) Plant Physiol. 91:694-701], and 
sunflower (Everett et al., , (1987) Bio/Technology 
10 5:1201-1204), and e) use of genes as restriction 
fragment length polymorphism (RFLP) markers in a 
breeding program, which makes introgression of recessive . 
traits into elite lines rapid and less expensive 

(Tanksley et al. (1989) Bio/Technology 7:257-264) . [ 

15 However , each of these technologies requires p 
identification and isolation of commercially-important 
genes. ; 

Oil biosynthesis in plants has been fairly well- 
studied (see Harwood (1989) in Critical Reviews in Plant 

20 Sciences, Vol. 8 (l):l-43). The biosynthesis of 

palmitic, stearic and oleic acids occurs in the plastids 

of plant cells by the interplay of three key enzymes of 

the "ACP track": palmitoyl-ACP elongase, stearoyl-ACP 

desaturase and acyl-ACP thioesterase. i_ 

.25 Of these three enzymes, acyl-ACP thioesterase j 

removes the acyl chain from the carrier protein (ACP) V 
and thus from the metabolic pathway. The same enzyme, 
with slightly differing efficiency, catalyzes the 
hydrolysis of the palmitoyl, stearoyl and oleoyl-ACP 

30 thioesters. This multiple activity leads to substrate 
competition between enzymes and it is the competition of 
acyl-ACP thioesterase and palmitoyl-ACP elongase for the 
same substrate and of acyl-ACP thioesterase and 

i 

stearoyl-ACP desaturase for the same substrate that j 



10 



leads to the production of a particular ratio of 
palmitic, stearic and oleic acids. 

Once removed from the ACP track by the action of 
acyl-ACP thioesterase, fatty acids are exported to the 
cytoplasm and there used to synthesize acyl-coenzyme A 
(CoA) . These acyl-CoA's are the acyl donors for at 
least three different glycerol acylating enzymes 
(glycerol-3-P acyltransferase, l-acyl-glycerol-3-P 
acyltransferase and diacylglycerol acyltransferase) 
which incorporate the acyl moieties into 
triacylglycerides during oil biosynthesis. 

These acyltransf erases show a strong, but not 
absolute, preference for incorporating saturated fatty 
acids at positions 1 and 3 and monounsaturated fatty 
acid at position 2 of the triglyceride. Thus, altering 
the fatty acid composition of the acyl pool will drive 
by mass action a corresponding change in the fatty acid 
composition of the oil. Furthermore, there is 
experimental evidence that, because of this specificity, 
given the correct composition of fatty acids, plants can 
produce cocoa butter substitutes (Bafor et al., (1990) 
J. Amer. Oil Chemists Soc. 67:217-225). 

Based on the above discussion, one approach to. 
altering the levels of palmitic, stearic and oleic acids 
in vegetable oils is by altering their levels in the 
cytoplasmic acyl-CoA pool used for oil biosynthesis. 

It should be possible to genetically modulate the 
competition both between palmitoyl-ACP elongase and 
acyl-ACP thioesterase and between stearoyl-ACP 
desaturase and thioesterase by modulating the expression 
level- of thioesterase. While alteration of stearoyl-ACP 
desaturase activity either upward or downward may change 
the existing ratio Of oleate : stearate and similarly 
altered expression of palmitoyl-ACP elongase might lead 
to new palmitate: (stearate + oleate) ratios, only 
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modification of the acyl-ACP thioesterase activity is 
expected to change the amounts of both palmitate and 
stearate with one genetic manipulation. Increased 
competition leading to increased levels of palmitic and 
5 stearic acids would result from over-expression of 

cloned and re-introduced thioesterase genes which is the 
more proven technology, while decreased competition 
. leading to decreased total saturated fatty acid would 
result from expression of antisense message from the 

10 acyl-ACP thioesterase gene. The simultaneous and 

opposite manipulation of the palmitoyl-ACP elongase and 
stearoyl-ACP desaturase activities would be required to 
achieve these same effects. There are thus two 
advantages to the use of nucleotide sequences encoding 

15 the acyl-ACP thioesterase to increase saturated fatty j~ 
acid content in vegetable oil over the manipulation of 
the other two mentioned enzymes: 1) the manipulation 
does not require antisense technology and 2) both the 
palmitate and stearate levels should be elevated with 

20 one genetic manipulation. 

Transfer of one or both of these nucleic acid 
fragments of SEQ ID NOs:l and 3 of the invention or a 
part thereof that encodes a functional enzyme, with 
suitable regulatory sequences into a living cell will 

25 result in the production or over-production of acyl-ACP \ 
thioesterase, which may result in increased levels of i 
palmitic and stearic acids in cellular lipids, including 
oil. 

Transfer of the nucleic acid fragment or fragments 
30 of the invention, with suitable regulatory sequences 

that transcribe the present cDNA, into a plant having an 
endogenous seed acyl-ACP thioesterase substantially 
homologous with the present cDNA may inhibit by 

cosuppression the expression of the endogenous acyl-ACP | 
35 thioesterase gene and, consequently, result in a 



i- 
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decreased amount of palmitic and stearic acids in the 
seed oil (Jorgenson, Trends Biotech. (1990) 340-344) . 

Transfer of the nucleic acid fragment or fragments 
of the invention into a soybean plant with suitable 
5 regulatory sequences that transcribe the antisense RNA . 
complementary to the mRNA, or its precursor, for seed 
acyl-ACP thioesterase may inhibit the expression of the 
endogenous acyl-ACP thioesterase gene and, consequently, 
result in reduced amounts of palmitic and stearic acids 

10 in the seed oil. 

The nucleic acid fragments of the invention can 
also be used as restriction fragment length polymorphism 
(RFLP) markers in soybean genetic studies and breeding ^ 
programs. 

15 TffiFTWTTIONS 

In the context of this disclosure, a number of 
terms shall be utilized. As used herein, the term 
"nucleic acid" refers to a large molecule which can be 
single stranded or double stranded, composed of monomers 

20 (nucleotides) containing a sugar, phosphate and either a 
purine or pyrimidine. A "nucleic acid fragment" is a 
fraction of a given nucleic acid molecule. In higher 
plants, deoxyribonucleic acid (DNA) is the genetic 
material while ribonucleic acid (RNA) is involved in the 

25 transfer of the information in DNA into proteins. A 
"genome" is the entire body of genetic material 
contained in each cell of an organism. The term 
"nucleotide sequence" refers to a polymer of DNA or RNA 
which can be single- or double-stranded, optionally 

30 containing synthetic, non-natural or altered nucleotide 
bases capable of incorporation into DNA or RNA polymers. 
As used herein, the term "homologous to" refers to the 
complementarity between the nucleotide sequence of two 
nucleic acid molecules or between the amino acid 

35 sequences of two protein molecules. Estimates of such 
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homology are provided by either DNA-DNA or DNA-RNA 
hybridization under conditions of stringency as is well 
understood by those skilled in the art (as described in 
Haines and Higgins (eds.) Nucleic Acid Hybridisation, IRL 
5 Press, Oxford, U.K. (1985)); or by the comparison of l 
sequence similarity between two nucleic acids or I 
proteins. As used herein, "substantially homologous" \ 
refers to nucleic acid molecules which require less I 
stringent conditions of hybridization than those for 

10 homologous sequences, and coding DNA sequence which may 
involve base changes that do not cause a change in the 
encoded amino acid, or which involve base changes which 
may alter an amino acid, but not affect the functional 
properties of the protein encoded by the DNA sequence. f 

15 Thus, the nucleic acid fragments described herein t 

include molecules which comprise possible variations of 
the nucleotide bases derived from deletion, ; 
rearrangement, random or controlled mutagenesis of the 
nucleic acid fragment, and even occasional nucleotide 

20 sequencing errors so long as the DNA sequences are 
substantially homologous. 

"Gene" refers to a nucleic acid fragment that 
expresses a specific protein, including regulatory 
sequences preceding (5' non-coding) and following (3 1 ^ 

25 non-coding) the coding region. "Acyl-ACP thioesterase ] 
gene" refers to a nucleic acid fragment that expresses a i- 
protein with acyl-ACP thioesterase activity. "Native" 
gene refers to the gene as found in nature with its own 
regulatory sequences. "Chimeric" gene refers to a gene 

30 that is comprised of heterogeneous regulatory and coding 
sequences. "Endogenous" gene refers to the native gene 
normally found in its natural location in the genome. A 
"foreign" gene refers to a gene not normally found in 
the host organism but that is introduced by gene j 

35 transfer. 1 
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"Coding sequence" refers to a DNA sequence that 
codes for a specific protein and excludes the non-coding 
sequences. It may constitute an "uninterrupted coding 
sequence", i.e., lacking an intron, such as in a cDKA or 
5 it may include one or more introns bounded by 

appropriate splice junctions. An "intron" is a sequence p 
of RNA which is transcribed in the primary transcript 
but which is removed through cleavage and re-ligation of 
the RNA within the cell to create the mature mRNA that 
10 can be translated into a protein. 

"Initiation codon" and "termination codon" refer to 
a unit of three adjacent nucleotides in a coding 
sequence that specifies initiation and chain j 
termination, respectively, of protein synthesis (mRNA £ 

L_ 

15 translation) . "Open reading frame" refers to the amino ^ 
acid sequence encoded between translation initiation and 
termination codons of a coding sequence. 

"RNA transcript" refers to the product resulting 
from RNA polymerase-catalyzed transcription of a DNA 

20 sequence. When the RNA transcript is a perfect 

complementary copy of the DNA sequence, it is referred 

to as the primary transcript or it may be a RNA sequence 

derived from posttranscriptional processing of the 

primary transcript and is referred to as the mature RNA. j_ 

25 "Messenger RNA" (mRNA) refers to the RNA that is. without j 
introns and that can be translated into protein by the • 
cell. "cDNA" refers to a double-stranded DNA that is 
complementary to and derived from mRNA. "Sense" RNA 
refers to RNA transcript that include the mRNA. 

30 "Antisense RNA" refers to a RNA transcript that is 
complementary to all or part of a target primary 
transcript or mRNA and that blocks the expression of a 
target gene by interfering with the processing, j 
transport and/or translation of its primary transcript j 

35 or mRNA. The complementarity of an antisense RNA may be 
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with any part of the specific gene transcript, i.e. , at 

the 5' non-coding sequence, 3' non-coding sequence, 

introns, or the coding sequence. In addition, as used 

herein, antisense RNA may contain regions of ribozyme 
5 sequences that may increase the efficacy of antisense 

RNA to block gene expression. "Ribozyme" refers to a 

catalytic RNA and includes sequence-specific 

endoribonucleases . 

As used herein, "suitable regulatory sequences" 
10 refer to nucleotide sequences located upstream (5 1 ), 

within, and/or downstream (3 f ) to a coding sequence, 

which control the transcription and/or expression of the . 

coding sequences, potentially in conjunction with the 

protein biosynthetic apparatus of the cell. In 
15 artificial DNA constructs regulatory sequences can also jl 

control the transcription and stability of antisense 

RNA, 

"Promoter" refers to a DNA sequence in a gene, 
usually upstream (5') to its coding sequence, which 

20 controls the expression of the coding sequence by 

providing the recognition for RNA polymerase and other 
factors required for proper transcription. In 
artificial DNA constructs promoters can also be used to 
transcribe antisense RNA. Promoters may also contain 

25 DNA sequences that are involved in the binding of j 
protein factors which control the effectiveness of ! 
transcription initiation in response to physiological or 
developmental conditions. It may also contain enhancer 
elements. An "enhancer" is a DNA sequence which can 

30 stimulate promoter activity. It may be an innate 
element of the promoter or a heterologous element 
inserted to enhance the level and/or tissue-specificity 
of a promoter. "Constitutive promoters" refers to those t 

that direct gene expression in all tissues and at all i 

i 

35 times. "Tissue-specific" or "development-specific" 
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promoters as referred to herein are those that direct 
gene expression almost exclusively in specific tissues, 
such as leaves or seeds, or at specific development 
stages in a tissue, such as in early or late 
5 embryogenesis, respectively. k 
The term "express ion ", as used herein, is intended 
to mean the production of a functional end-product. 
Expression or overexpression of the acyl-ACP 
thioesterase genes involves transcription of the gene 

10 and translation of the mRNA into precursor or mature 
acyl-ACP thioesterase proteins. "Antisense inhibition 
refers to the production of antisense RNA transcripts 
capable of preventing the expression of the target ■ j= 

protein, "Overexpression" refers to the production of a 

15 gene product in transgenic organisms that exceeds levels 
of production in normal or non-transformed organisms. 
"Cosuppression" refers to the expression of a transgene 
which has substantial homology to an endogenous gene 
resulting in the suppression of expression of both the 

20 ectopic and the endogenous gene. 

"Altered expression" refers to the production of 
gene product (s) in transgenic organisms in amounts or 
proportions that differ significantly from that activity 
in comparable tissue (organ and of developmental type) 

25 from wild-type organisms. 

The "3' non-coding sequences" refers to the DNA 
sequence portion of a gene that contains a 
polyadenylation signal and any other regulatory signal 
capable of affecting mRNA processing or gene expression. 

30 The polyadenylation signal is usually characterized by 
affecting the addition of polyadenylic acid tracts to 
the 3 ! end of the mRNA precursor. 

"Mature" protein refers to a functional acyl-ACP 
thioesterase enzyme without its transit peptide. 

35 "Precursor" protein refers to the mature protein with a 
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native or foreign transit peptide. "Transit" peptide 
refers to the amino terminal extension of a polypeptide, 
which is translated in conjunction wi.th the polypeptide 
forming a precursor peptide and which is required for 
5 its uptake by plastids of a cell. 

"Transformation" herein refers to the transfer of a 
foreign gene into the genome of a host organism and its 
genetically stable inheritance. "Restriction fragment 
length polymorphism" refers to different sized 
10 restriction fragment lengths due to altered nucleotide 
sequences in or around variant forms of genes. * 
"Fertile" refers to plants that are able to propagate 
sexually. 

"Oil producing species" herein refers to plant 
15 species which produce and store triacylglycerol in 
specific organs, primarily in seeds. Such species 
include soybean, canola, sunflower, cotton, cocoa, 
peanut, saf flower and corn. The group also includes 
non-agronomic species which are useful in developing 
20 appropriate expression vectors such as tobacco and 
ArabidPPSis thaliana, and wild species which may be a 
source of unique fatty acids. 

Purification of Soybean Seed Acvl-ACP Thioesterase 
25 In order to modulate the activity of acyl-ACP 

thioesterase in the seed, it is essential to isolate or 
purify the complete gene(s) or cDNA(s) encoding the 
target enzyme (s) . 

Acyi-ACP thioesterase proteins were purified to a 
30 protein mixture containing either two or three peptides 
when analyzed by SDS polyacrylamide gel electrophoresis 
(SDS-PAGE) starting from the soluble fraction of 
extracts made from developing soybean seeds following 
binding to DEAE-cellulose, ammonium sulfate 
35 precipitation, chromatographic separation on blue 
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sepharose, high performance anion exchange, alkyl-ACP 
sepharose, and phenyl-Superose. In a typical 
preparation, the fold purification of thioesterase 
activity was about 8500. The preparation runs as a 
5 single band in native polyacrylamide gel 

electrophoresis, and as a single, symmetrical peak in 
gel filtration chromatography indicating a native 
molecular weight of about 75 kD. SDS-PAGE of these 
preparations showed a very minor peptide of about 39 kD 

10 and two major peptides at about 33 and 34 kD. 

Polyclonal antibodies raised to each of these 
peptides individually in mice cross-reacted in all 
combinations upon Western analysis indicating that the 
peptides are antigenically very similar. All attempts 

15 at separating these three peptides with retention of 

thioesterase activity failed. The peptides at 33 and 34 
kD could be separated from the 39 kD peptide by reverse 
phase chromatography on a diphenyl matrix. The mixture 
containing these two peptides was analyzed by N-terminal 

20 amino acid sequencing and one main sequence of the 
following amino acid order was found: 

Arg-Val-Glu-Ala-Pro-Gly-Gly-Thr-Leu-Ala-Asp-Arg-Leu 
(SXQ ID NO:5). 

25 

These results lead to the conclusion that the 
native thioesterase enzyme is a heterodimer of at least 
three polypeptides with similar amino acid sequences and 
nearly identical N-termini. Whether the mixture arises 
30 as the result of the expression of slightly dissimilar 
genes or as the result of heterogenous proteolytic 
processing at the carboxyl terminal of the product of 
one gene of or identical genes is not known. 
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Cloning of ftnyhpan S »*H Aevl-ACP ' Thioest erase cDNA - 
The combined 33 and 34 kD peptides from reversed 
phase purification were denatured and reduced with 
dithiothreitol (DTT) , then alkylated with vinyl 
pyridine . The derivatized protein was desalted, 
lyophilized and subjected to CNBr cleavage in 70% 
trifluoroacetic acid (TFA) solution. Peptide fragments 
produced by CNBr were separated by SDS-PAGE . 
electophoresis, elctrophoretically transferred onto 
Immobilon®-P membrane and stained with non-acid 
Coomassie blue. Three main peptides of 28, 16 and 14 kD 
were observed and cut from the blot for N-terminal 
sequencing. Of these, the peptide of 14 kD gave the 
following amino acid sequence from its N-terminal: 

Ile-Glu-Ile-Tyr-Lys-Tyr-Pro-Ala-Trp-Leu-Asp-Ile-Val-Glu-Ile 
(SSQ ZD HO: 6). 

Based on this sequence from the first He to the 
first two bases of the codon for the last He, a set of 
eight degenerate 41 nucleotide-long oligonucleotides was 
synthesized. The design took into account the codon ■ 
usage in selected soybean seed genes and used six 
deoxyinosines at positions of ambiguity. The probe, 
following radiolabeling, was used to screen a cDNA 
expression library made. in Lambda Zap vector from polyA* 
KNA from 20-day-old developing soybean seeds. Five 
positively hybridizing plaques were subjected to plaque 
purification. Sequences of the pBluescript (Stratagene) 
vector, including the cDNA inserts, from each of the 
purified phage stocks were excised in the presence of a 
helper phage and the resultant phagemids used to infect 
£. coli cells resulting in double-stranded plasmids, 
p22A, p22B, p23A, p25A, and p25B. 
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The cDNA insert in plasmid p22B is flanked at both 
ends by the two EcoRI sites introduced by the cDNA 
construction and its cloning into the vector 
pBluescript. The nucleotide sequence of the cDNA insert 
5 in p22B encodes a 367 amino acid open reading frame that j 
includes the N-terminal sequence found in the purified f 
protein at the fifty-sixth amino acid of the open 
reading frame. Thus the first fifty-five amino acids 
are presumably the transit peptide required for import 

10 of the precursor protein into the plastid. The 
methionine codon at base number 106 of p22B is the 
apparent start methionine since a) it is the first 
methionine after the last stop codons 5' to and inframe ^ 
with the N-terminal sequence and, b) the N-terminal i_ 

15 methionine in all but one known chloroplast transit 
peptides is followed by alanine. Thus, it can be 
deduced that the acyl-ACP thioesterase precursor protein 
encoded by this gene consists of a fifty-five amino acid 
transitpeptide and a 312 amino acid mature protein 

20 before any further proteolytic processing occurs. A 
fusion protein comprising the first sixteen amino acids 
of p-galactosidase, and beginning at the fourth amino 
acid of the mature soybean seed acyl-ACP thioesterase in 
an appropriate plasmid is expressed in £. £fili and is i- 

25 catalytically functional. 

The entire cDNA insert in p22B was cut from the ' 
Bluescript plasmid, radiolabeled and used as a probe for 
additional thioesterase genes in the soybean seed cDNA 
library. Five additional clones were characterized. Of 

30 these, one is identical to clone 22B from one hundred 
bases before the stop codon of the open reading frame 
and through the 3 ' non-coding region . The other four 
appear to be identical to each other, but differ from j 
22b. One of these clones (4C) was sequenced completely j 

35 and is shown in SEQ ID NO: 3. The open reading frame on 
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the cDNA encodes a thioesterase precursor protein which 
is again 367 amino acids in length and which, at the 
amino acid level, is 97% identical to the thioesterase 
encoded by insert 22B. Both the 5' and 3' non-coding 
5 sequences of the two genes diverge in identity as the 
distance from the open reading frame increases • 

The fragments of the instant invention may be used, 
if desired, to isolate substantially homologous acyl-ACP L 
thioesterase cDNAs and genes, including those from plant 

10 species other than soybean. Isolation of homologous 
genes is well-known in the art. Southern blot analysis 
reveals that the soybean cDNA for the enzyme hybridizes 
to several, different-sized DNA fragments in the genomic 
DNA of tomato, rapeseed fBrassica nBPUS) , soybean, F 

15 sunflower and Arabidopsis (which has a very simple 
genome) . Although the number of different genes or 
"pseudogenes" (non-functional genes) present in any 
plant is unknown, it is expected to be more than one 
since acyl-ACP thioesterase is an important enzyme. 

20 Moreover, plants that are amphidiploid (that is, derived 
from two progenitor species), such as soybean, rapeseed 
(fi. napus ) . and tobacco will have genes from both 

progenitor species. 

25 Overexpression of the Enzvme in Transgenic Species 

The nucleic acid fragment of the instant invention 
encoding soybean seed acyl-ACP thioesterase cDNA, or a 
coding sequence derived from other cDNAs or genes for 
the enzyme, with suitable regulatory sequences, can be 

30 used to overexpress the enzyme in transgenic soybean as 
well as other transgenic species. Such a recombinant 
DNA construct may include either the native acyl-ACP 
thioesterase gene or a chimeric gene. One skilled in 
the art can isolate the coding sequences from the 

35 fragment of the invention by using and/or creating sites 
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for restriction endonucleases, as described Sanibrook 
et al. Molecular Cloning: A Laboratory Manual, 2nd Ed. 
(1989), Cold Spring Harbor Laboratory Press. Of 
particular utility are sites for Nco I (5'-CCATGG-3«) 
5 and Sph I (S'-GCATGC-SM that allow precise removal of 
coding sequences starting with the initiating codon ATG. 
For isolating the coding sequence of acyl-ACP 
thioesterase precursor from the fragment of invention, 
an Nco I site can be engineered by substituting 

10 nucleotide A at position 105 with C. Cutting at this 
engineered site (or alternatively at an existing Hind 
III (S'-AAGCTT-S') site beginning at base pair 93 of 
p22B) along with cuts at restriction endonuclease sites 
near the 3' end of p22B such as the Spi I at 1339 or the 

15 Xmn I site at 1562 allows removal of the fragment 

encoding the acyl-ACP thioesterase precursor protein and 
directional re-insertion into a properly designed 
vector. 

20 Inhibition of Plant Target 

Senas hv v ?* nf Anti sense B8& 
Antisense RNA has been used to inhibit plant target ' . 

genes in a dominant and tissue-specific manner (see 
van der Krol et al., Gene (1988) 72:45-50; Ecker et al., |_ 

25 Proc. Natl. Acad. Sci. USA (1986) 83:5372-5376; 

van der Krol et al., Nature (1988) 336:866-869; Smith 
et al.., Nature (1988) 334:724-726; Sheehy et al., Proc. 
Natl. Acad. Sci. USA (1988) 85:8805-8809; Rothstein 
et al., Proc. Natl. Acad. Sci. USA (1987) 84:8439-8443; > 

30 Cornelissen et al., Nucl. Acids Res. (1988) 17:833-843; 
Cornelissen, Nucl. Acid Res. .(1989) 17:7203-7209; Robert 
et al., Plant Mol. Biol. (1989) 13:399-409; Cannon 
et al., Plant Molec. Biol. (1990) 15:39-47). 

The use of antisense inhibition of the seed enzyme 

35 would require isolation of the coding sequence for genes 
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, that are expressed in the target tissue of the target 
plant. Thus, it will be more useful to use the fragment 
of the invention to screen seed-specific cDNA libraries, 
rather than genomic libraries or cDNA libraries from j 
5 other tissues, from the appropriate plant for such 

sequences. Moreover, since there may be more than one 1 
gene encoding seed acyl-ACP thioesterase, it may be 
useful to isolate the coding sequences from the other 
genes from. the appropriate crop. The genes that are 

10 most highly expressed are the best targets for antisense 
inhibition. The level of transcription of different 
genes can be studied by known techniques, such as 
nuclear run-off transcription. . 
There have been examples of using the entire cDNA j_ 

15 sequence for antisense inhibition (Sheehy et al., Proc. , 
Natl. Acad. Sci. USA (1988) 85:8805-8809). Thus, for 
expressing antisense RNA in soybean seed from the 
fragment of the invention, the entire fragment of the 
invention (that is, the entire cDNA for soybean seed 

20 acyl-ACP thioesterase within the restriction sites 

described above) may be used. There is also evidence 

that the 3' non-coding sequences can play an important 

role in antisense inhibition (Ch'ng et al., Proc. Natl. 

Acad. Sci. USA (1989) 86:10006-10010) or short fragments *- 

25 of 5* coding sequence (as few as 41 base-pairs of a 1.87 f 
kb cDNA) (Cannon et al., Plant Molec. Biol. (1990) \ 
15:39-47). Thus, for expressing antisense RNA in 
soybean seed from the fragment of the invention, a small 
fragment of the invention, consisting of at least 41 

30 base pairs of the acyl-ACP thioesterase cDNA, may also k 
be used. 
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Inhibition of Plant Target 
(fenes bv flpfmppression 
The phenomenon of cosuppression has also been used 
to inhibit plant target genes in a dominant and tissue- 
5 specif ic manner (Napoli et al., The Plant Cell (1990) H 
2:279-289; van der Krol et al., The Plant Cell (1990) » 
2:291-299; Smith et al., Mol. Gen. Genetics (1990) 224: 
477-481). The nucleic acid fragment of the instant 
invention encoding soybean seed acyl-ACP thioesterase 

10 cDNA, or a coding sequence derived from other cDNAs or 
genes for the enzyme, along with suitable regulatory 
sequences, can be used to reduce the level of the enzyme 
in a transgenic oilseed plant which contains an j 
endogenous gene substantially homologous to the £ 

15 introduced acyl-ACP thioesterase cDNA. The experimental p 
procedures necessary for this are similar to those 
described above for sense overexpression of the 
acyl-ACP thioesterase cDNA. Cosuppressive inhibition of 
an endogenous gene using the entire cDNA sequence 

20 (Napoli et al., The Plant Cell (1990) 2:279-289; van der 
Krol et al., The Plant Cell (1990) 2:291-299) and also 
using part of a gene (730 bp of a 1770 bp cDNA) (Smith 
et al., Mol. Gen. Genetics (1990) 224:477-481) are 
known. Thus, all or part of the nucleotide sequence of 

25 the present acyl-ACP thioesterase cDNA may be used to j 
reduce the levels of acyl-ACP thioesterase enzyme in a » 
transgenic oilseed. 

j^Tprt-ior rvf Hosts , prnmrvt-ers, Enhancers 
30 A preferred class of heterologous hosts for the 

expression of the coding sequence of acyl-ACP 

thioesterase precursor or the antisense RNA are 

eukaryotic hosts, particularly the cells of higher 

plants. Particularly preferred among the higher plants ■' j 

35 are the oilcrops, such as soybean (Glycine mas) , 
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rapeseed f Brassica nfl2U£, £. CSITTPestris) r sunflower 
( Helianthus annus ) , cotton (GOSSypiUm hirSUtUm) r com 
( Zea mavs ) f cocoa f Theobroma cacao) f and peanut (ftrachis 
hypoaaea ) . Expression in plants will use regulatory 
5 sequences functional in such plants. 

The expression of foreign genes in plants is well- 
established (De Blaere et al. r Meth. Enzymol. (1987) 
153:277-291). The origin of promoter chosen to drive 
the expression of the coding sequence or the antisense 
10 RNA is not critical provided it has sufficient 

transcriptional activity to accomplish the invention by 
increasing or decreasing, respectively, the level of 
translatable mRNA for acyl-ACP thioesterase in the 
. desired host tissue. Preferred promoters include (a) 
15 strong constitutive plant promoters, such as those 
directing the 19S and 35S transcripts in Cauliflower 
mosaic virus (Odell et al., Nature (1985) 313:810-812; 
Hull et al. r Virology (1987) 86:482-493), and (b) 
tissue- or developmentally-specif ic promoters. Examples 
20 of tissue-specific promoters are the light-inducible 
promoter of the small subunit of ribulose 1,5-bis- 
phosphate carboxylase if expression is desired in 
photosynthetic tissues, maize zein protein (Matzke 
et al. r EMBO J. (1984) 3:1525), and chlorophyll a/b 
25 binding protein (Lampa et al., Nature (1986) 316:750- 
752) . 

Particularly preferred promoters are those that 
allow seed-specific expression. This may be especially 
useful since seeds are the primary source of vegetable 

30 oils and also since seed-specific expression will avoid 
any potential deleterious effect in non-seed tissues. 
Examples of seed-specific promoters include, but are not 
limited to, the promoters of seed storage proteins, 
which can represent up to 90% of total seed protein in 

35 many plants. The seed storage proteins are strictly 
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regulated, being expressed almost exclusively in seeds 
in a highly tissue-specific and stage-specific manner 
(Higgins et al., Ann. Rev. Plant Physiol. (1984) 
35:191-221; Goldberg et al., Cell (1989) 56:149-160). 
5 Moreover, different seed storage proteins may be 
expressed at- different stages of seed development. 

Expression of seed-specific genes has been studied 
in great detail (See reviews by Goldberg et al., Cell 
(1989) 56:149-160 and Higgins et al., Ann. Rev. Plant 

10 Physiol. (1984) 35:191-221). There are currently 

numerous examples for seed-specific expression of seed 
storage protein genes in transgenic dicotyledonous 
plants. These include genes from dicotyledonous plants 
for bean p-phaseolin (Sengupta-Gopalan et al., Proc. 

15 Natl. Acad. Sci. USA (1985) 82:3320-3324; Hoffman 

et al., Plant Mol. Biol. (1988) 11:717-729), bean lectin 
(Voelker et al., EMBO J. (1987) 6:35?l-3577) , soybean 
lectin (Okamuro et al., Proc. Natl. Acad. Sci. USA 

(1986) 83:8240-8244), soybean Kunitz trypsin inhibitor 
20 (Perez-Grau et al., Plant Cell (1989) 1:095-1109), 

soybean p-conglycinin (Beachy et al., EMBO J. (1985) 
4:3047-3053; pea vicilin (Higgins et al., Plant Mol. 
Biol. (1988) 11:683-695), pea convicilin (Newbigin 
et al., Planta (1990) 180:461), pea legumin (Shirsat 

25 et al., Mol. Gen. Genetics (1989) 215:326); rapeseed 
napin (Radke et al., Theor. Appl. Genet. (1988) 75:685- 
694) as well as genes from monocotyledonous plants such 
as for maize 15 kD zein (Hoffman et al., EMBO J. (1987) 
6:3213-3221), maize 18 kD oleosin (Lee at al., Proc. 

30 Natl. Acad. Sci. USA (1991) 888:6181-6185), barley 
p-hordein (Marris et al., Plant Mol. Biol. (1988) 
10:359-366) and wheat glutenin (Colot et al., EMBO J. 

(1987) 6:3559-3564) . Moreover, promoters of seed- 
specific genes operably linked to heterologous coding 

35 sequences in chimeric gene constructs also maintain 
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their temporal and spatial expression pattern in 
transgenic plants. Such examples include ftrabidopsis 
thaliana 2S seed storage protein gene promoter to 
express enkephalin peptides in ftrahidQPSiS and fi. napus 
5 seeds (Vandekerckhove et al. r Bio/Technology (1989) 
7:929-932) , bean lectin and bean p-phaseolin promoters 
to express luciferase (Riggs et al., Plant Sci. (1989) 
63:47-57), and wheat glutenin promoters to express _ 
chloramphenicol acetyl transferase (Colot et al., EMBO 

10 J. (1987) 6:3559-3564) . 

Of particular use in the expression of the nucleic 
acid fragment of the invention will be the heterologous 
promoters from several soybean seed storage protein 
genes such as those for the Kunitz trypsin inhibitor • J| 

15 (Jofuku et al., Plant Cell (1989) 1:1079-1093; glycinin _ j" 
(Nielson et al.. Plant Cell (1989) 1:313-328), and 
jj-conglycinin (Harada et al., Plant Cell (1989) 
1:415-425) . Promoters of genes for a- and P-subunits of 
soybean p-conglycinin storage protein will be 

20 particularly useful in expressing the mRNA or the 

antisense RNA to acyl-ACP thioesterase in the cotyledons 

at mid- to late-stages of seed development (Beachy et 

al., EMBO J. (1985) 4:3047-3053 in transgenic plants. 

This is because there is very little position effect on j_ 

25 their expression in transgenic seeds, and the two j 
promoters show different temporal regulation. The ■ 
promoter for the a-subunit gene being expressed a few 
days before that for the p-subunit gene. This is 
important for transforming rapeseed where oil 

30 biosynthesis begins about a week before seed storage 
protein synthesis (Murphy et al., J. Plant Physiol. 
(1989) 135:63-69). 

Also of particular use will be promoters of genes 
expressed during early embryogenesis and oil j 

35 biosynthesis. The native regulatory sequences, 
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including the native promoter, of the acyl-ACP 
thioesterase gene expressing the nucleic acid fragment 
of the invention can be used following its isolation by 
those skilled in the art. Heterologous promoters from 
5 other genes involved in seed oil biosynthesis, such as f 
those for £. nanus isocitrate lyase and malate synthase i 
(Comai et al., Plant Cell (1989) 1:293-300), ArabidPPSiS 
ACP (Post-Beittenmiller et al., Nucl. Acids Res. (1989) 
17:1777), £. nanus ACP (Safford et al., Eur. J. Biochem. 

. 10 (1988) 174:287-295), fi. nnmpestlls ACP (Rose et al " 

Nucl. Acids Res. (1987) 15:7197), and Zea mays oleosln 

(Lee et al., Proc. Natl. Acad. Sci.. USA (1991) 88:6181- 

6185) may also be used. The genomic DNA sequence for £. 

nanus oleosin is also published (Lee et al., Plant J7 

15 Physiol. (1991) 96:1395-1397) and one skilled in the art p 
can use this sequence to isolate the corresponding 
promoter. The partial protein sequences for the 
- relatively-abundant enoyl-ACP reductase and acetyl-CoA 
carboxylase are published (Slabas et al., Biochim. 

20 Biophys. Acta (1987) 877:271-280; Cottingham et al., 
Biochim. Biophys. Acta (1988) 954:201-207) and one 
skilled in the art can use these sequences to isolate 
the corresponding seed genes with their promoters. 

Attaining the proper level of expression of acyl- j_ 

25 < ACP thioesterase mRNA or antisense RNA may require the ] 
use of different chimeric genes utilizing different j 
promoters. Such chimeric genes can be transfered into 
host plants either together in a single expression 
vector or sequentially using more than one vector. 

30 It is envisioned that the introduction of enhancers 

or enhancer-like elements into either the native acyl- 
ACP thioesterase promoter or into other promoter 
constructs will also, provide increased levels of primary 
transcription for antisense RNA or in RNA for acyl-ACP 

35 thioesterase to accomplish the inventions. This would 
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include viral enhancers such as that found in the 35S 
promoter (Odell et al., Plant Mol. Biol. (19B8) 10:263- 
272), enhancers from the opine genes (Fromm et al., 
Plant Cell (1989) 1:977-984), or enhancers from any 

-. 

5 other source that result in increased transcription when t 
placed into a promoter operably linked to the nucleic f 
acid fragment of the invention. 

Of particular importance is the DNA sequence 
element isolated from the gene for the a-subunit of 

10 p-conglycinin that can confer 40-fold seed-specific 
enhancement to a constitutive promoter (Chen et al., 
Dev. Genet. (1989) 10:112-122). One skilled in the art 
can readily isolate this element and insert it within 
the promoter region of any gene in order to obtain seed- \_ 

15 specific enhanced expression with the promoter in f= 
transgenic plants. Insertion of such an element in any 
seed-specific gene that is expressed at different times L 
than the [J-conglycinin gene will result in expression in 

transgenic plants for a longer period during seed 

20 development. 

The invention can also be accomplished by a variety 
of other methods to obtain the desired end. In one 
f orm, the invention is based on modifying plants to 
produce increased levels of acyl-ACP thioesterase by |_ 

25 . virtue of having significantly larger numbers of copies j 
of the acyl-ACP thioesterase gene product. This may [ 
result in sufficient increases in acyl-ACP thioesterase 
activity levels to accomplish the invention. 

Any 3 1 non-coding region capable of providing a 

30 polyadenylation signal and other regulatory sequences 
that may be required for the proper expression of the 
acyl-ACP thioesterase coding region can be used to 
accomplish the invention. This would include the native 
3 1 end of the substantially homologous soybean acyl-ACP j 

35 thioesterase gene(s), the 3 1 end from a heterologous 



acyl-ACP thioesterase, the 3« end from viral genes such 
as the 3' end of the 35S or the 19S cauliflower mosaic 
virus transcripts, the 3« end from the opine synthesis 
genes, the 3' ends of ribulose 1, 5-bisphosphate 
carboxylase or chlorophyll a/b binding protein, or 3' 
end sequences from, any source such that the sequence 
employed provides the necessary regulatory information 
within its nucleic acid sequence to result in the proper 
expression of the promoter/acyl-ACP thioesterase coding 
region combination to which it is operably linked. 
There are numerous examples in the art that teach the 
usefulness of different 3' non-coding regions. 

Han&iarma&ioa Methods 

Various methods of transforming cells of higher 
plants according to the present invention are available 
to those skilled in the art (see EPO Pub. 0 295 959 A2 
and 0 318 341 Al) . Such methods include. those based on 
transformation vectors based on the Ti and Ri plasmids 
of agiafaastfiriam sop. It is particularly preferred to 
use the binary type of these vectors. Ti-derived 
vectors transform a wide variety of higher plants, 
including monocotyledonous and dicotyledonous plants, 
Sukhapinda et al., Plant Mol. Biol. (1987) 8:209-216; 
Potrykus, Mol. Gen. Genet. (1985) 199:183). Other 
transformation methods are available to those skilled in 
the art, such as direct uptake of foreign DNA constructs 
(see EPO Pub. 0 295 959 A2), techniques of 
elect roporat ion (Fromm et al;, Nature (1986) (London) 
319:791) or high-velocity ballistic bombardment with 
metal particles coated with the nucleic acid constructs 
(Kline et al., Nature (1987) (London) 327:70). Once 
transformed, the cells can be regenerated by those 
skilled in the art. 



WO 92/11373 PCT/US91/09160 

t • 

Of particular relevance are the recently described 

methods to transform foreign genes into commercially 
important crops, such as rapeseed (De Block et al., 
Plant Physiol • (1989) 91:694-701), sunflower (Everett 
5 et al., Bio/Technology (1987) 5:1201), and soybean 
(Christou et al., Proc. Natl. Acad. Sci USA (1989) 
86:7500-7504) and corn (Fromm et al., (1990) 
Bio/technology 8:833-839). I 

10 Applicatio n tn RFLP Technology 

The use of restriction fragment length polymorphism 
(RFLP) markers in plant breeding has been well- 
documented in the art (Tanksley et al., Bio/Technology 
(1989) 7:257-264). The nucleic acid fragment of the f 

15 invention indicates two gene copies by Southern 

blotting. Both of these have been mapped on a soybean 
RFLP map (Tingey et al., J. Cell Biochem. (1990), 
Supplement 14E p. 291, abstract R153) and can be used as 
RFLP markers for traits linked to these mapped loci. 

20 These traits will include altered levels of palmitic, 
stearic and oleic acid. The nucleic acid fragment of 
the invention can also be used to isolate the acyl-ACP 
thioesterase gene from variant (including mutant) 
soybeans with altered stearic acid levels. Sequencing 

25 of these genes will reveal nucleotide differences from j 
the normal gene that cause the variation. Short t j 

oligonucleotides designed around these differences may 
be used as hybridization probes to follow the variation 
in stearic, palmitic and oleic acids. Oligonucleotides 

30 based on differences that are linked to the variation 
may be used as molecular markers in breeding these 
variant oil traits. 

; 
j 



EJffiMElE 1 

T c;^.ftTTnW m r rDWA FOR 
<^vpf.&m SEEP flr vT -* rp THTOESTERASE 
PREPARATION OF RADIOLABELED PALMITOYL, STEAROYL AND 

OLEOYL-ACP 

To frozen £. fiflli cell paste, (0.5 kg of 1/2 log 
phase growth of £. fifili B grown on minimal media and 
obtained from Grain Processing Corp, Muscatine, IA) was 
added 50 mL of a solution 1 M in Tris, 1 M in glycine, 
and 0.25 M in EDTA. Ten mL of 1 M MgCl 2 was added and 
the suspension was thawed in a water bath at 50°C. As 
the suspension approached 37 e C it was transferred to a 
37°C bath, made to 10 mM in 2-mercaptoethanol and 20 rog 
of DNAse and 50 mg of lysozyme were added. The 
suspension was stirred for 2 h, then sheared by three 20 
sec bursts in a Waring Blendor. The volume was adjusted 
to 1 L and the mixture was centrifuged at 24,000xg for 
30 min. The * resultant supernatant was centrifuged at 
90,000xg for 2h. The resultant high-speed pellet was 
saved for extraction of acyl-ACP synthase (see below) 
and the supernatant was adjusted to pH 6.1 by the 
addition of acetic acid. The extract was then made to 
50% in 2-propanol by the slow addition of cold 
2-propanol to the stirred solution at 0°C. The 
resulting precipitate was allowed to settle for 2 h and 
then removed by centrifugation at 16,000xg. The 
resultant supernatant was adjusted to pH 6.8 with KOH 
and applied at 2 mL/min to a 4.4 x 12 cm column of DEAE- 
Sephacel which had been equilibrated in 10 mM MES, pH 
6.8. The column was washed with 10 mM MES, pH 6.8 and 
eluted with 1 L of a gradient of LiCl from. 0 to 1.7 M in 
the same buffer. Twenty mL fractions were collected and 
the location of eluted ACP was determined by applying 



WO 92/11373 



PCI7US91/09160 



ft 



33 



10 JiL of every second fraction to a lane of a native 
polyacrylamide (20% acrylamide) gel electrophoresis 
(PAGE) . Fractions eluting at about 0.7 M LiCl contained 
nearly pure ACP and were combined, dialyzed overnight 
5 against water and then lyophilized. 

Purification of Acyl-ACP Synthase 

Membrane pellets resulting from the high-speed 
centrifugation described above were homogenized in 360 

10 mL of 50 mM Tris-Cl, pH 8.0, and 0.5 M in NaCl and then 
centrifuged at 80,000xg for 90 min. The resultant 
supernatant was discarded and the pellets resuspended in 
50 mM Tris-Cl, pH 8.0, to a protein concentration of 12 
mg/mL. The membrane suspension was made to 2% in Triton 

15 X-100 and 10 mM in MgCl2, and stirred at 0°C for 20 min 
before centrifugation at 80,000xg for 90 min. The 
protein in the resultant supernatant was diluted to 
5 mg/mL with 2% Triton X-100 in 50 mM Tris-Cl, pH 8.0 
and, then, made to 5 mM ATP by the addition of solid ATP 

20 (disodium salt) along with an equimolar amount of 

NaHC03. The solution was warmed in a 55°C bath until 
the internal temperature reached 53°C and was then 
maintained at between 53°C and 55°C for 5 min. After 
5 min the solution was rapidly cooled on ice and 

25 centrifuged at 15,000xg for 15 min. The supernatant 
from the heat treatment step was loaded directly onto a 
column of 7 mL Blue Sepharose 4B which had been 
equilibrated in 50 mM Tris-Cl, pH 8.0, and 2% Triton 
. X-100. The column was washed with 5 volumes of the 

30 . loading buffer, then 5 volumes of 0.6 M NaCl in the same 
buffer and the activity was eluted with 0.5 M KSCN in 
the same buffer. Active fractions were assayed for the 
synthesis of acyl-ACP, as described below, combined, and 
bound to 3 mL settled-volume of hydroxy lapat it e 

35 equilibrated in 50 mM Tris-Cl, pH 8.0, 2% Triton X-100. 
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The hydroxylapatite was collected by centrifugation, 
washed twice with 20 mL.of 50 mM Tris-Cl, pH 8.0, 2% 
Triton X-100. The activity was eluted with two 5 mL 
washes of 0.5 M potassium phosphate, pH 7.5, 2% Triton 
5 X-100. The first wash contained 66% of the activity and 
it was concentrated with a 30 kD membrane filtration 
concentrator (Amicon) to 1.5 mL. 

Smtteais of Badifllatelfid acvi-acp 

10 A solution of t 3 H] palmitic acid, [ 14 CJ-stearic 

acid and [ 14 C] -oleic acid (120 nmol each) prepared in 
methanol were dried in a glass reaction vial. The ACP 
preparation described above (1.15 mL, 32 nmol) was added 
along with 0.1 mL of 0.1 M ATP, 0.05 mL of 80 mM DTT, 

15 0.1 mL of 8 M LiCl, and 0.2 mL of 13% Triton X-100 in 
0.5 M Tris-Cl, pH 8.0, with 0.1 M MgCl 2 . The reaction 
was mixed thoroughly and 0.3 mL of the acyl-ACP synthase 
preparation was added and the reaction was incubated at 
37°C. After 0.5 h intervals a 10 JlL aliquot was taken 

20 and dried on a small filter paper disc. The disc was 
washed extensively with chloroform: methanol: acetic acid 
(8:2:1, v:v:v) and radioactivity retained on the disc 
was taken as a measure of stearoyl-ACP . At 2 h about 
88% of the ACP had been consumed. The reaction mixes 

25 were diluted 1 to 4 with 20 mM Tris-Cl, pH 8.0, and 
applied to 1 mL DEAE-Sephacel columns equilibrated in 
the same buffer. The columns were washed in sequence 
with 5 mL of 20 mM Tris-Cl, pH 8.0, 5 mL of 80% 2- 
propanol in 20 mM Tris-Cl, pH 8.0, and eluted with 0.5 M 

30 LiCl in 20 mM Tris-Cl, pH 8.0. The column eluates were 
passed directly onto 3 mL columns of octyl-sepharose CL- 
4B which were washed with 10 mL of 20 mM potassium 
phosphate, pH 6.8, and then eluted with 35% 2-propanol 
in 2 mM potassium phosphate, pH 6.8. The eluted 
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products were lyophilized and redissolvedat a 
concentration of 24 HM. 

PREPARATIO N OF ALKYL-ACP AFFINITY COLUMN 

5 

Synthesis o f N-hexadprvl j orioacetamide 

1-Hexadecylamine (3.67 mmol) was dissolved in 14.8 
mL of CH2Cl2r cooled to 4°C, and 2.83 mmol of iodoacetic 
anhydride in 11.3 mL of CH2CI2 was added dropwise to the 

10 stirred solution. The solution was warmed to room » 
temperature and held for 2 h. The reaction mixture was 
diluted to about 50 mL with CH2CI2 and washed 3 times 
(25 mL) with saturated sodium bicarbonate solution and 
then 2 times with water. The volume of the solution was 

15 reduced to about 5 mL under vacuum and passed through 25 
mL of silica in diethyl ether. The eluate was reduced 
to an off-white powder under vacuum. This yielded 820 
mg (2.03 mmol) of theN-hexadecyliodoacetamide (71.8% - 
yield) . The 300 MHz l E NMR spectra of the product was 
. 20 consistent with the expected structure. 

Synthesis of N-Hftxadecvlacetamido-S-ACP 

£. Cflli ACP prepared as above (10 mg in 2 mL of 
50 mM Tris-Cl, pH 7.6) was treated at 37°C with 50 mM 

25 DTT for 2 h. The solution was made to 10% 

trichloroacetic acid (TCA) , held at 0°C for 20 min and 
centrifuged to pellet. The resultant pellet was washed 
(2 x 2 mL) with 0.1 M citrate, pH 4.2 and redissolved in 
3 mL of 50 mM potassium phosphate buffer. The pH of the 

30 ACP solution was adjusted to 7.5 with 1 M KOH and 3 mL 
of N-hexadecyliodoacetamide (3 mM in 2-propanol) was 
added. A slight precipitate of the N-hexadecyliodo- 
acetamide was redissolved by warming the reaction mix to 
45°C. The mixture was held a 45°C for 6 h. SDS-PAGE on 

35 20% acrylamide PAGE gel showed approximately 80% 



apvt.-acp thtoesterase ASSAY 
Acyl-ACP thioesterase was assayed as described by 
35 McKeon et al., (J. Biol. Chem. (1982) 257:12141-12147). 
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conversion to an ACP species of intermediate mobility 
between the starting, reduced ACP and authentic 
palmitoyl-ACP. Excess N-hexadecyliodo acetamide was 
removed from the reaction mix by 4 extractions (3 mL) 
5 with CH 2 Cl2 with gentle mixing to avoid precipitation of jt 
the protein at the interface. r 

Coupling of N-Hexadecylacetamido-S-ACP . 
nmi-aet<yj»»Art'fipph»rose 4B 

10 Cyanogen bromide-activated Sepharose 4B (Pharmacia, 

2 g) was suspended in 1 mM HC1 and extensively washed by 
filtration and resuspension in 1 mM HC1 and finally one 
wash in 0.1 M NaHCOs, pH 8.3. The N-hexadecyl- j_ 
acetamido-S-ACP prepared above was diluted with an equal j_ 

15 volume of 0.2 M NaHCC-3, pH 8.3. The filtered cyanogen j= 
bromide-activated Sepharose 4B (about 5 mL) was added to 
the N-hexadecylacetamido-S-ACP solution, the mixture was 
made to a volume of 10 mL with the 0.1 M NaHCC-3, PH 8.3, 
and mixed by tumbling at room temperature for 6 h. 

20 Protein remaining in solution (Bradford assay)- indicated 
approximately 85% binding. The gel suspension was 
collected by cent rifu gat ion, washed once with the 0.1 M 
NaHC03, pH 8.3, and resuspended in 0.1 M ethanolamine 
adjusted to pH 8.5 with HC1. The suspension was. allowed |- 

25 to stand at 4°C overnight and then washed by ! 
centrifugation and re-suspension in 12 mL of 0.1 M ! 
acetate, pH 4.0, 0.5 M in NaCl and then 0.1 M NaHC03, pH 
8.3, 0.5 M in NaCl. The alkyl-ACP Sepharose 4B was . 
packed into a 1 x 5.5 cm column and washed extensively 

30 with 20 mM bis-tris propane-Cl (BTP-C1) , pH 7.2, before 
use. 
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Each of the radiolabeled acyl-ACP's were adjusted to 
concentrations ranging from 0.05 p to 1.5 JIM and a 
volume of 25 JIL with a reaction buffer consisting of 
1 mg/mL bovine serum albumin in 0.1 M Tricine buffer at 
5 pH 8.2. Reactions were started with 5 \LL of soybean 
seed extract containing acyl-ACP thioesterase activity 
and incubated for times varying from 12 sec to 5 min 
depending upon the activity of the fraction. Reactions 
were terminated by the addition of 100 JIL of a solution 

10 of 5% acetic acid in 2-propanol and extracted twice with 
1 mL each of water saturated hexane. Five mL of 
ScintiVerse Bio HP (Fisher) scintillation fluid was 
added to the combined extracts and radioactivity in the 
released fatty acids was determined by scintillation f 

15 counting. 

For routine assays during acyl-ACP thioesterase 
purification [ 14 C] stearoyl-ACP at a concentration of 0.6 
fiM was used in the assay as described above. 



i- 



20 PURIFICATION QF SOYBEAN ACYL-ACP THIOESTERASE 

Developing soybean seeds ( Glycine max cultivar 
Wye), ca. 20-25 days after flowering, were harvested and 
stored at -80° until use. One kg of the seeds were 
added while frozen to 2 L of a buffer consisting of 50 

25 .mM TRIS/HC1 pH 8.0/2 mM DTT and 0.2mM EDTA in a Waring 
blehdor and ground until thawed and homogenized. The. 
homogenate was centrifuged at 14,000xg for 20 min, 
decanted and the supernatant was centrifuged at 35,000xg 
for 45 min. The resulting high speed supernatant was 

30 adjusted to 55% saturation with ammonium sulfate at 4° 
and protein was allowed to precipitate for 30 min before 
centrifugation at 14,000xg for 15 min to remove 
precipitated proteins. The precipitated was dissolved 
in 50 mM BTP-HCl buffer, pfl 7.2, 1 mM in 2-mercapto- 

35 ethanol and dialyzed overnight against 15 L of the same 
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buffer at 5 mM. The dialyzed ammonium sulfate fraction 
was adjusted to a buffer concentration of 20 mM, a 
protein concentration of 5 mg/mL and Triton X-100 was 
added to a final concentration of 0.02%. One third of i 
5 the resulting solution was applied to a 250 mL column of 

Blue sepharose contained in a radial flow column. The f 
flow rate was approximately 75 mL/min and the column 
wash washed with the application buffer until the - 
absorbance at 280. nm monitored at the column efflux 

10 returned to zero after application of the protein. 

Acyl-ACP thioesterase activity was eluted with 1 M NaCl 

in the same buffer and the column was washed with an 

additional three column volumes of the salt containing |_ 

buffer before re-equilibration with six column volumes j_ 

15 of the starting buffer. This procedure was repeated ^ 
twice more to bind and elute all of the acyl-ACP 
thioesterase activity present in the 55% airanonuim 
sulfate fraction. 

The combined Blue sepharose eluates were brought. to 

20 85% saturation in ammonium sulfate at 4°, allowed to 

precipitate for 30 min, then centrifuged to at 20 f 000xg . 

for 20 min. The resulting pellet was redissolved in 

20 mM TRIS-HC1, pH 7.4, 0.2 mM in EDTA and 1 mM in DTT 

then dialyzed overnight against 4 L of the same buffer. g_ 

25 The dialysate was centrifuged at 22,000xg for 20 min j 
then applied at a flow rate of 5 mL/min to Mono Q HR ! 
16/10 anion exchange column (Pharmacia) egulibrated in 
the same buffer. After application of the protein, the 
column was washed with the same buffer until the 

30 absorbance at 280 nm monitored at the column efflux 
returned to near zero. The loaded column was re- 
equilibrated to pH 8.5 with 20 mM TRIS-HC1, and after 
the pH monitored at the column efflux was stable at that 
pH, elution was begun with the following program: NaCl • .j 

35 concentration in the TRIS buffer system was increased 
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linearly from 0 to 0.1 M over 10 min, then held at 0.1 M 
for 10 min. The NaCl concentration was then increased 
linearly from 0.1 M to 0.3 M over 80 min. The acyl-ACP 
thioesterase activity eluted broadly from an NaCl 
5 concentration of 0.165 M to 0.275 M. Active fractions i 
were pooled, precipitated with ammonium sulfate as after | 
Blue sepharose elution, redissolved in 20 mM BTP-HC1 at r 
pH 7.2 and dialyzed overnight against 2 L of the same 
buffer at 5 mM. After dialysis, the Mono Q fraction was 

10 adjusted to 20 mM BTP-HC1 and 0.02% Triton X-100 before 
application to the .alkyl-ACP affinity column. The 
column was loaded at 1 mL/min, then washed with the 
application buffer until the absorbance at 280 nm 
monitored at the column efflux returned to zero. The f 

15 column was then washed with 0.1 M NaCl in the same h 
buffer until a protein peak was washed from the column ) 
and the column efflux 280 nm absorbance returned to zero 
before elution of the acyl-ACP thioesterase activity 
with 1 M NaCl in the BTP-HC1 buffer system. 

20 The eluant from the alkyl-ACP column was made to 

1 M in ammonium sulfate and applied at at flow rate of 
0.5 xriL/min to a Phenyl Superose HR 5/5 column 
(Pharmacia) which was egulibrated with 1 M ammonium 
sulfate in 50 mM potassium phosphate buffer at pH 7.0. 

25 After sample application, the column was washed with \ 
equilibration buffer until the absorbance at 280 nm ] 
returned to zero and then eluted with a 20 mL gradient 
from 1 M ammonium sulfate in the potassium phosphate 
buffer to the buffer alone. 

30 Acyl-ACP thioesterase containing fractions from the 

Phenyl Superose column contained from 400 to 600 ^g of 
protein and were enriched in specific activity of the 
acyl-ACP thioesterase by from 8,500 to 10,500 fold 
depending upon the preparation. Gel filtration ' 

35 chromatography of the Phenyl Superose purified acyl-ACP * 
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thioesterase on an UltroPac TSK G200 SW (0.75x60 cm, 
Pharmacia) eluted with 0.1 M potassium phosphate buffer 
at 1 mL/min gave one major protein peak which also 
corresponded with the acyl-ACP thioesterase activity. 
5 The molecular size estimation of that peak was 

approximately 75 kD. Analysis of the peptides present 
in the gel filtration purified acyl-ACP thioesterase 
showed three peptides of 39, 34 and 33 kD in size. The 
peptide at 39 kD was always least abundant and was not 
10 clearly visible in some preparations. Of the 34 and 33 
kD peptides, the abundance of the 34 kD peptide slightly 
exceeds that of the 33 . Further separation of these 
three peptides with retention of thioesterase activity --■ 
has not been possible. 

15 ... f 

anMhnriv Precipitation Of jB&Sail Sfifid ftSV l -ftCP 

Th;ift p g 1 '» rase :. 
Acyl-ACP thioesterase purified through the Phenyl 
Superose step was denatured with DTT and SDS applied to 
20 a gradient polyacrylamide gel (9 to 15% acrylamide) and 
subjected to SDS electrophoresis. The developed gel was 
stained with a 9:1 mixture of 0.1% Coomassie blue in 50% 
methanol to 0.5% Serva blue in 50% methanol then 

partially destained with 3% glycerol in 20% methanol. |_ 
25 The peptide doublet at 33 and 34 kD was cut from the j 
gel, frozen in liquid nitrogen, then ground to a powder j 
and suspended in 50 mM sodium phosphate buffer. The 
suspended gel with protein was sent for antibody 
production in New Zealand White rabbit by Hazelton 
30 Research Products Inc. Denver, PA. Serum obtained after 
three injections of the combined 33 and 34 kD peptides 
identified those peptides in Western analysis, but also 
cross-reacted with the much less abundant peptide at j 
39 kD which was not included in the antigen j 
35 preparations. The anti-33 and 34 kD serum was purified , 
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by immune OTfinity chromatography. Approximately one mg 
of acyl-ACP thioesterase purified through the Phenyl 
Superose step of the purification sequence described 
above was bound to CNBr activated sepharose (Pharmacia) 
according to the manufacturer's instructions. Five mL 
of the antiserum was equilibrated in 10 mM potassium 
phosphate buffer (pH 7.4) by gel filtration, mixed with 
the antigen bound to sepharose and allowed to bind 
overnight at 4°. The sepharose was poured into a small 
column and washed with 5 column volumes of the phosphate 
buffer then eluted with 0.1 M glycine (pH 2.5). 
Fractions of 0.9 mL were collected in tubes containing 
0.05 mL of 2 M TRIS and 1 mg of bovine serum albumin. 
Fractions containing the anti-33 and 34 kD peptide 
15 immunoglobin were identified by using each fraction as 
the antibody in Western analysis. Active fractions were 
pooled and concentrated to approximately 50 JIL by 
membrane concentration. 

Soybean seed acyl-ACP thioesterase was purified 
20 through the Mono-Q anion exchange step described in the 
scheme above. Fold purification over the starting 
extract was about 60. Ten \IL of this preparation was 
added to 2 fiL of 0.1 M TRIS/glycine buffer (pH 8.0) 
which contained fi;om 0 to 2 }iL of the purified antibody 
25 preparation. The solution was incubated for 45 min at 
room temperature, then 20 JIL of Protein A-sepharose 
(Sigma) was added and the mixture was incubated an 
additional 30 min. The Protein A-sepharose was removed 
by centrifugation and 3 JIL of the supernatant was taken 
for the standard acyl-ACP thioesterase assay. Pre- 
immune serum from the rabbit was diluted 1 to 10 in the 
incubation mix with the acyl-ACP thioesterase 
preparation, incubated and treated with Protein 
A-sepharose as above for a control." Net activity of the 
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acyl-ACP thioesterase preparation after treatment with 
various dilutions of the antibody are shown below: 

TABLE 2 



Dilution of antibody 


Net pmol/JlL/min 


1 to 1000 


3.46 


1 to 500 


3.60 


1 to 100 


3.85 


1 to 50 


2.24 


1 to 25 


0.90 


1 to 16.6 


0.29 


1 to 12.5 


0.26 


1 to 10 


0.30 


1 to 5 


0 


Pre-immune 1 to 10 


3.46 


No antibody 


4.07 



The acyl-ACP thioesterase can thus be precipitated 
by the anti-33 and 34 kD antiserum, indicating that 
20. these two peptides are either all or part of the soybean 
seed thioesterase enzyme, 

N-Terminal and Internal Amino Acid Sequence from the 
Acvl-ACP t hioesterase ; 

25 Acyl-ACP thioesterase purified through the Phenyl 

Superose step of the standard scheme was purified by 
reversed-phase chromatography to remove the small amount 
of the 39 kD peptide and a trace of lower molecular 
weight contaminant. One hundred fig of the preparation 

30 in 1 roL total volume was made to 0.1% trifluoroacetic 
acid (TFA) and loaded at 0.2 mL/min onto a Vydek 
diphenyl reversed phase column. The column was washed 
for 20 min with 0.1% TFA, then eluted by stepping to 25% 
acetonitrile in 0.1% TFA, washing for 8 min then eluting 

35 with a gradient from 25 to 70% acetonitrile in 0.1% TFA. 
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The 33 and 34 kD peptides eluted together at 35.5% 
acetonitrile. 

The combined peptides in the reverse phase purified 
fraction were used to determine the N-terminal amino 
5 acid sequence on an Applied Biosystems 470A Gas Phase 
Sequencer. PTH amino acids were analyzed on an Applied 
Biosystems 120 PTH amino Acid Analyzer. The N-terminal 
sequence was determined to be: 

10 R-V-E-A-P-G-G-T-L-A-D-R-L (SEQIDN0:5). 

Other residues were present in most cycles , most 
notably the P in cycle 5 and the G in cycle 6. 

Internal fragments of .the combined peptides were 

15 also generated by CNBr cleavage. Acyl-ACP thioesterase 
purified through the Phenyl Superose step in the 
purification scheme (400 \ig in 290 JIL) was denatured by 
the addition of 24 \IL of *M TRIS at pH 8.0, 15 mg of 
DTT, 31 pi of 0.5 M EDTA and solid guanidine-HCl to make 

20 the solution 6 M in guariidine. The solution was 
incubated at room temperature for 2.5 h before the 
addition of 33 J1L of 4-vinyl-pyridine and then incubated 
an additional 4 h. The solution was desalted by 
dilution to 2.5 mL and passage through a sephadex G-25 

25 column which had been equilibrated in 2 mM TRIS, pH 8.0. 
The solution was lyophilized, redissolved in 400 |IL of 
70% TFA f placed in a sealable flask then evacuated and 
flushed with N2. CNBr (2 mg in 2 JIL of 70% TFA) was 
added and the flask was again evacuated and flushed with 

30 N2. After incubation for 20 h in the dark at room 
temperature , the reaction mixture was diluted to 4 mL 
with water and again lyophilized. The residue was 
dissolved in water and approximately 200 Jig (on the 
basis of the starting protein) was precipitated with 10% 

35 trichloroacetic acid (TCA) . The resulting pellet was 
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removed by centrifugation, then washed in sequence with 
acetone, 1% TCA and acetone again. The washed pellet 
was dissolved in 100 JIL of 1% SDS with 7% glycerol and 
loaded onto a 20% crosslinked polyacrylamide gel for 
5 electrophoresis. The developed gel was 

electrophoretically blotted onto Immobilon membrane 
(Millipore), stained with 0.5% coomassie blue in 50% 
methanol and destained with 50% methanol. Three 
prominent bands of about 28 kD, 16 kD and 14 kD were cut 

10 from the Immobilon, and the N-terminal sequence of each 
was determined by gas phase sequencing as described 
above. With the exceptions of the 5th, 6th and 8th 
cycles, the sequence of the 28 kD fragment was identical 
to the N-terminal of the non^CNBr treated protein 

15 although other residues were present in all cycles. 
Nine cycles of sequence were obtained from the 16 kD 
band and 16 from the 14 kD band. The first nine cycles 
were identical for the two peptides, and the common 
sequence obtained for the fragments is as follows: 



r.lnnino of g nyhAan Bead Acvl-ftCP ThloeBterase CPNft 
Based on the N-terminal sequence from cycle 2 
25 through 11, a set of 64 degenerate 29 nucleotide-long 
probes were designed for use as a hybridization probe: 

(SEQ ID NO: 7) 

PROTEIN SEQUENCE: VEAPGGT LAD 



20 



I-E-I-Y-K-Y-P-A-W-L-D-I-V-E-I (SEQ ID NO: 6). 
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(SEQ ID NO: 8) 

DNA SEQUENCE: 5'-GTT GAA GCN CCA GGA GGN ACN TTT GCA GA 



G 



G 



T 



T 



C G 



T 
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(SEQ ID NO: 9) 

PROBE: 5«-GTT GAA GCI CCA GGI GGI ACI TTT GCA GA 

G G T C G T 

5 The design took into account the codon bias in \. 

representative soybean seed genes encoding Bowman-Birk p 
protease inhibitor (Hammond et al., J. Biol. Chem. j~ 
(1984) 259:9883-9B90), glycinin subunit A-2B-la (Utsumi 
et al., Agric. Biol. Chem. (1987) 51:3267-3273), lectin 

10 (le-1) (Vodkin et al., Cell (1983) 34:1023-1031), and 
lipoxygenase-1 (Shibata et al., J. Biol. Chem. (1987) 
262:10080-10085). Four deoxyinosines were used at 
selected positions of ambiguity. 

A cDNA library was made as follows: Soybean f 

15 embryos (ca. 50 mg fresh weight each) were removed from j~ 
the pods and frozen in liquid nitrogen. The frozen 
embryos were ground to a fine powder in the presence of 
liquid nitrogen and then extracted by Polytron 
homogenization and fractionated to enrich for total RNA 

20 by the method of Chirgwin et al. (Biochemistry, (1979) 
18:5294-5299). The nucleic acid fraction was enriched 
for polyA+ RNA by passing total RNA through an oligo-dT 
cellulose column and eluting the polyA+ RNA by salt as 
described by Goodman et al. (Meth. Enzymol. (1979) 

25 68:75-90). cDNA was synthesized from the purified \ 
polyA+ RNA using cDNA Synthesis System (Bethesda i 
Research Laboratory) and the manufacturer's 
instructions. The resultant double-stranded DNA was 
methylated by DNA methylase (Promega) prior to filling- 

30 in its ends with T4 DNA polymerase (Bethesda Research 
Laboratory) and blunt-end ligating to phosphorylated 
EcoRI linkers using T4 DNA ligase (Pharmacia) . The 
double-stranded DNA was digested with EcoRI enzyme, 
separated from excess linkers by passing through a gel ; 

35 filtration column (Sepharose CL-4B), and ligated to 1 
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lambda ZAP vector (Stratagene) as per manufacturer's 
instructions. Ligated DNA was packaged into phage using 
Gigapack packaging extract (Stratagene) according to 
manufacturer's instructions. The resultant cDNA library 

t 

5 . was amplified as per Stratagene' s instructions and |- 
stored at -80°C. P 

Following the instructions in lambda ZAP Cloning 
Kit Manual (Stratagene), the cDNA phage library was used 
to infect £. coli BB4 cells and plated to yield ca. 

10 35,000 plaques per petri plate (150 mm diameter). 
Duplicate lifts of the plates were made onto 
nitrocellulose filters (Schleicher & Schuell) . 
Duplicate lifts from five plates were prehybridized in 
25 mL of Hybridization buffer consisting of 6X SSC J 

15 (0.9 M NaCl, 0.09 M sodium citrate, pH 7.0), 5X t 
Denhardt's [0.5 g Ficoll (Type 400, Pharmacia), 0.5 g 
polyvinylpyrrolidone, 0.5 g bovine serum albumin 
(Fraction V; Sigma) ] , 1 mM EDTA, 1% SDS, and 100 ug/mL 
denatured salmon sperm DNA (Sigma Chemical Co.) at 45°C 

20 for 10 h. Fifty pmol of the hybridization probe (see 
above) were end-labeled in a 52.5 uL reaction mixture 
containing 50 mM Tris-Cl, pH 7.5, 10 mM MgCl2, 0.1 mM 
spermidine-HCl (pH 7.0), 1 mM EDTA (pH 7.0), 5 mM DTT, 
200 \LCi (66.7 pmol) of gamma-labeled AT 32 P (New England 

25 Nuclear) and 25 units of T4 polynucleotide kinase (New j 
England Biolabs) . After incubation at 37°C for 45 min, . i 

the reaction was terminated by heating at 68°C for 10 
min. Labeled probe was separated from unincorporated 
AT 32 P by passing the reaction through a Quick-Spin™ (G- 

30 25 Sephadex®) column (Boehringer Mannheim Biochemicals) . 
The purified labeled probe (1.2 x 10 7 dpm/pmol) was 
added to the prehybridized filters, following their 
transfer to 10 mL of fresh Hybridization buffer. 

Following incubation of the filters in the presence of j 
35 the probe for 48 h in a shaker at 48°C, the filters were [ 
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washed in 200 mL of Wash buffer (6X SSC, 0.1% SDS) five 
times for 5 min each at room temperature, then at 48°C 
for 5 min and finally at 62°C for 5 min. The washed 
filters were air dried and subjected to autoradiography 
5 on Kodak XAR-2 film in the presence of intensifying 
screens (Lightening Plus, DuPont Cronex®) at -80°C 

overnight. Six positively-hybridizing plaques were 
subjected to plaque purification as described in 
Sambrook et al. (Molecular Cloning, A Laboratory Manual, 
10 2nd ed. (1989), Cold Spring Harbor Laboratory Press). 
None of the potential positives purified to completion 
and all were eventually dropped as false positives. 

A second oligonucleotide probe was constructed 
based on the amino acid sequence derived from the CNBr 

15 fragments at 14 and 16 kD as follows: 

•> 

(SEQ ID NO: 10) 

PROTEIN SEQUENCE I £ I YKYP A W L D I £ I 

20 (SEQ ID NO:ll) 

DNA SEQUENCE : 5 1 -ATN GAA ATN TAC AAA TAC CCN GCN TGG CTN GAC ATN GAA ATN 

G T G T T TT G 

(SEQ ID Np:12) 

25 PROBE: ATI GAA ATI TAT AAA TAT CCI GCI TGG TTI GAT ATI GAA AT 

G G G 

The design is based on the same codon bias 
assumptions as the N-terminal probe described above, 

30 with the additional simplification of eliminating the C 
at all G/C ambiguities. Probe radiolabeling was done as 
described for the N-terminal probe and hybridization of 
nitrocellulose lifts was done similarly, except that the 
hybridization temperature was lowered to 37°. Screening 

35 of. five plates with approximately 33,000 plaques each 
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produced five positives which were then plaque purified. 
Of the five positives, four purified and isolated 
plaques could be taken corresponding to radioactive 
signals on the lifts in the second round of 
5 purification. § 
Following the Lambda ZAP Cloning Kit Instruction 
Manual (Stratagene) , sequences of the pBluescript 
vector, including the cDNA inserts, from each of four ■" 
purified phages were excised in the presence of a helper 

10 phage and the resultant phagemids were used to infect 
£. coli XL-1 Blue cells resulting in double-stranded 
plasmids, p22A, p22B, p25A and p25B. Purity of the 
clones was checked by colony hybridization and a single, 
positive colony from each was used for culture £ 

15 preparation. ] 
DNA from the, plasmids was made by the alkaline 
lysis miniprep procedure described in Sambrook et al. 
(Molecular Cloning, A Laboratory Manual, 2nd Ed. (1989) 
Cold Spring Harbor Laboratory Press). The alkali- 

20 denatured double -stranded DNA from p22B was sequenced 
using Sequenase* T7 DNA polymerase (US Biochemical 
Corp.) and the manufacturer's instructions. The 
sequence of the cDNA insert in plasmid p22B is shown in 
SEQ ID NO:l. I" 

/ ! 

P.YRMPLE 2 
F.YPBF.fifiTON QF. gnVBF.RN SEED 
&CHC&££ TH TOP.fiTF.RASE IN E. COLI 

Construction of JJ-Galactosidase-Acyl-ACP Thioesterase 

30 Fusion Protein . . 

Sequences which are inserted into pBluescript 
directionally correct and in-frame with the start 
methionine of the interrupted 0-galactosidase gene borne 
on the plasmid are capable of being expressed as fusion j 
35 proteins consisting of the N-terminal sixteen amino ; 
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acids of p-galactosidase plus those encoded by the 
inserted sequence. Sequencing of p22B revealed that the 
cDNA insert of that plasmid was directionaly correct but 
1 base out of frame. Two Jig of p22B was digested for 2 
5 h with 30 units of Bam HI. This cleaves once in the 
polylinker site of the Bluescript portion of the plasmid 
and once at a Bam HI site beginning at base 282 of the 
insert in p22. The complete digestion gave two 
fragments, one of 301 bases from the 5 1 end of the cDNA 

10 insert and a portion of the poly linker region of 

Bluescript, and a 4.2 kB fragment composed of Bluescript 
and the 3' 1320 bases of the cDNA insert. The 4.2 kB 
fragment was purified by electrophoretic separation on a 
6% polyacrylamide gel. run in TRIS/borate/EDTA buffer. 

15 The fragment was visualized by ethidium bromide 

staining, cut from the gel, eluted into TRIS/EDTA buffer 
overnight at 37° and precipitated by the addition of 
sodium acetate to 0,3 M and ethanol to 50%. The two 
half-Bam HI sites on the purified fragment were re- 

20 ligated by incubation of 50 ng of the fragment in a 
25 JIL reaction with eight units of T4 DNA ligase 
overnight at 16°C. Competent £, XL-1 blue cells 

(Statagene) were transformed with 30 ng of the ligated 
plasmid. Transformants were picked as ampicillin- 

25 resistant cells after overnight growth. Eight colonies 
were chosen and mini-preparations of plasmid DNA were 
made by the alkaline lysis procedure described above. 
Agarose gel electrophoresis of the uncut plasmids next 
to supercoiled weight standards showed that all eight 

30 plasmids were approximately 4.2 kB in size. The eight 
transformed cell lines containing plasmids designated 
p22Ba through p22Bh along with untransformed XL-1 blue 
cells and the transformed line carrying p22B were grown 
overnight in 5 mL of TB media with 0.2% glucose. The 

35 overnight cultures were diluted 1:1 into fresh TB + 
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glucose media which also contained 10 mM isopropyl 
' thiogalactoside and growth was continued for 1.5 h at 
37°C. Cells were harvested by centrifugation and re- 
suspended in 1 mL of 50 mM TRIS, pH 8.0. A subsample 
5 containing 10 Jig protein was taken and added to 20 JLL of 
SDS sample buffer for analysis by SDS-PAGE and western 
blotting. The remaining sample was made to 10 mM with 
DTT, 0.2 mM with PMSF and broken by probe sonication. 
Cell debris was removed by centrifugation and 5 J1L of 
10 the extract was used in the standard acyl-ACP 

thioesterase assay using stearoyl-ACP as the substrate. 
The results are shown in Table 3. 



TABLE 3 



Extract 


Net. reaction 


xl-1 blue 


0.42 


p22B 


. 0.58 


p22Ba 


2.17 


p22Bb 


2.05 


p22Bc 


2.25 


p22Bd 


2.17 


p22Be 


' 2.11 


p22Bf 


2.25 


p22Bg 


1.84 


p22Bh 


1.71 



25 

While p22B does not have activity significantly 
greater than the endogenous £. call activity, 
thioesterase activity was greatly increased by the 
30 combination of removing the transit. peptide and placing 
the construction in-frame relative to the fusion protein 
start methionine. Western analysis of the proteins 
produced by each of the cell lines also showed a single, 
antibody-positive signal of about 42 kD in size produced 



by each of the in-frame plasmids, but no signal produced 
by plasmid p22B. 

Plasmid p22Ba was chosen for more detailed analysis 
using both palmitoyl and oleoyl-ACP as substrates. 
Cells containing p22B were used as the controls 
indicative of the endogenous £. £Qii thioesterase. When 
palmitoyl-ACP was used as substrate, p22B cell extract 
showed a low but measurable reaction rate while that of 
the p22Ba-containing cells was ten fold higher. When 
oleoyl-ACP was used as substrate, the rate of acyl-ACP 
hydrolysis by extract from the p22Ba-containing cells 
was 96 fold greater than that of the controls. 

EXAMPLE 3 

USE OF SOYBEAN SEED ACYL-ACP 
THIOESTERASE SEQUENCE IN PLASMID AS A 
RESTRICTION FRAGMENT LENGTH POLYMORPHISM ( RFLP1 MARKER 
The cDNA insert from plasmid p22B was removed from 
the Bluescript vector by digestion with restriction 
enzyme EcoRI in standard conditions as described in 
Sambrook et al. (Molecular Cloning, A Laboratory Manual, 
2nd ed. (1989) Cold Spring Harbor Laboratory Press) and 
labeled with 32 P using a Random Priming Kit from 
Bethesda Research Laboratories under conditions 
recommended by the manufacturer. The resulting 
radioactive probe was used to probe a Southern blot 
(Sambrook et al., Molecular Cloning, A Laboratory 
Manual, 2nd ed. (1989) Cold Spring Harbor Laboratory 
Press) containing genomic DNA from soybean ( Glycine max 
(cultivar Bonus) and filxcina (PI81762)), digested 

with one of several restriction enzymes . After 
hybridization and washes under standard conditions 
(Sambrook et al., Molecular Cloning, A Laboratory 
Manual, 2nd ed. (1989) Cold Spring Harbor Laboratory 
Press), autoradiograms were obtained and different 
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patterns of hybridization (polymorphisms) were 
identified in digests performed with restriction enzymes 
Pst 1 and Eco RI. The same probe was then used to map 
the polymorphic p22B loci on the soybean genome/ 
5 essentially as described by Helentjaris et al. (Theor. 
Appl. Genet. (1986)72:761-769). Plasmid pDSl probe was 
applied, as described above, to Southern blots of EcoRI, 
PstI, EcoRV, BamHI, or Hindlll digested genomic DNAs 
isolated from 68 F2 progeny plants resulting from a 

10 £. max Bonus x £. soia PI81762 cross. The bands on the 
autoradiograms were interpreted as resulting from the 
inheritance of either paternal (Bonus) or maternal 
(PI81762) pattern, or both (a heterozygote) . The 
resulting data were subjected to genetic analysis using 

15 the computer program Mapmaker (Lander et al., Genomics 
(1987) 1: 174-161). In conjunction with previously 
obtained data for 436 anonymous RFLP markers in soybean 
(S. Tingey et al., J. Cell. Biochem., Supplement 14E 
(1990) p. 291, abstract R153), we were able to position 

20 one genetic locus corresponding to the p22B probe on the 
soybean genetic map. This information will be useful in 
breeding soybean lines with altered saturate levels. 

EXAMPLE 4 

25 T7RF. OF SOYBF .RW RFF.tt ACYT.-APP THIOESTERRSE SEQUENCE IN 
py.RSMTT) p??B AS a PROBE FOR RnPTTIONAL SOYBEAN ACYL-ACP 

THTOF.STRRARE GENES 



digestion with EcoRI and purified by electrophoretic 
30 separation on 6% polyacrylamide . The 1.6 kB fragment 
was localized by ethidium bromide staining, eluted from 
the gel and precipitated from 0.3 M sodium acetate with 
50% ethanol. Thirty ng of the resulting DNA fragment 
was used as the template in a random primer labeling 
35 reaction using a labeling kit from Bethesda Research 



The cDNA insert in plasmid p22B was removed by 



«- 
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Laboratories. The early development soybean seed cDNA 
library described in Example 1 was re-plated at a plaque 
density of 35,000 per plate and duplicate nitrocellulose 
lifts from four plates were screened. The pre- \ 
5 hybridization and hybridization buffer was that jr 
described in Example 1, but the probe annealing f 
conditions were 50° for 40 h. The filter lifts were 
washed 3 times at room temperature with 0.6 x SSC 
containing 0.1% SDS, then once at 50°C for 5 min in the 

10 same solution. Two additional washes were given for 5 
min each at 50°C in 0.2 x SSC, 0.1% SDS followed by a 1 
min rinse under the same conditions. 

After autoradiography for 20 h, ten hybridizing 
plaques were identified. These were plaque purified and 

15 excised into Bluescript plasmids as described in Example |I 
1. To check for the similarity of the cDNA inserts in 
these plasmids to the sequence of soybean seed acyl-ACP 
thioesterase copy 1 shown in SEQ ID NO:l, a 30 base 
oligonucleotide was prepared for use as the extension 

20 primer in dideoxy sequencing reactions. The primer 
corresponded to bases 1028 to 1058 in the sequence of 
SEQ ID N0:13. 

The placement of the primer oligonucleotide on 
cDNA's similar to that found in p22B should allow ^ 

25 sequencing the 3' 100 bases of the open reading frame ] 
and 100 to 170 bases- of the 3 1 untranslated region. i 
Bluescript plasmids purified from six of the ten 
positively hybridizing clones described above were 
sequenced. Of these, one did not give a sequencing 

30 reaction with the primer. Sequencing from the universal 
and T3 primers of the Bluescript plasmid revealed that 
this clone was a partial cDNA, identical to the insert 
in p22B, but terminating 3' to the primer • region. The 
sequences of the remaining five clones used as templates j 

35 showed two classes of sequence, one clone identical 



through the region sequenced to the p22B and four 
examples of a second acyl-ACP thioesterase gene with a 
single base change in the portion of the open reading 
frame which was sequenced (at base 1094 of SEQ ID NO:l, 
C is changed to T) and decreased homology in the 3' non- 
coding region. 

Nucleotide 1 of SEQ ID N0:1 is the first nucleotide 
of the EcoRI cut site reading from 5- to 3 ■ on the cDNA 
insert and nucleotide 1602 is the last nucleotide of the 
cDNA insert in the EcoRI cut site of plasmid P 22B which 
encodes copy 1 of the soybean seed acyl-ACP 
thioesterase. Nucleotides 106 to 108 are the putative 
translation initiation codon, nucleotides 271 to 273 are 
the codon for the N-terminal of the purified enzyme, 
nucleotides 1207 to 1209 are the termination codon, 
nucleotides 1 to 5 are the 5« untranslated sequence and 
nucleotides 1210 to 1602 are the 3' untranslated 
nucleotides. 

Digestion of two of the plasmids (p4A and P 4C) with 
EcoRI followed by analysis on agarose gel 
electrophoresis showed cDNA inserts of 1.0 and 1.4 JcB 
respectively. Dideoxy sequencing of both plasmids 
showed them to be identical and the insert in P 4C to be 
a full length clone. By the very high degree of 
homology between the open reading frames of p4C and 
p22B, p4C can reasonably be expected to encode a second 
acyl-ACP thioesterase. The base and amino acid 
sequences of soybean seed acyl-ACP thioesterase (copy 2) 
are shown in SEQ ID NO :3. 

Nucleotide 1 of SEQ ID NO: 3 is the first nucleotide 
of the EcoRI cut site reading from 5' to 3' on the cDNA 
insert and nucelotide 1476 is the last nucleotide of the 
cDNA insert in the EcoRI cut site of plasmid p4C which 
encodes copy 2 of the soybean seed acyl-ACP 
thioesterase. The putative initiation codon is 
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nucleotides 117 to 119, the N-terminal of the mature 
protein is nucleotides 282 to 284 , and the termination 
codon is nucleotides 1218 to 1220. 

5 EXAMPLE 5 

p22B AS A PROBE FOR ACYT -ACP THTQESTERASE GENES FROM 

BB&SSICfi wapos. 

The 32p-i a b e i € d probe produced by random primed 

10 labeling using the EcoRI fragment from p22B as described 
in EXAMPLE 4 was used to screen a genomic library made 
from Brassic napus cultivar Bridger DNA (Clontech 
commercial library) . The library was plated on two 
plates at a density of approximately 60,000 plaques per 

15 plate and duplicate nitrocellulose lifts were taken for 
hybridization. The prehybridization and hybridization 
buffer was that described in Example 1 with annealing of 
the probe for 55 h at 42°C. The filter lifts were 
washed twice at room temperature! with 0.6x SSC 

20 containing 0;l% SDS followed by two 5 min washes and one 
1 min wash in the same solution and all at 52°C. 

Hybridizing plaques were identified by 
autoradiography for 18 h at -70°C. Of three positive 
signals present on the duplicate plates, two were chosen 

25 for plaque purification by removal from the plate, 
dilution and re-screening under the above described 
conditions. Single plaques from the two independent 
clones (designated pCANll and pCAN21) were chosen, cored 
to remove them from the plate, diluted and re-plated at 

30 low titer for amplification. Ten plaques from each of 
the clonal lines were selected, homogenized in buffer 
and used to inoculate a 0.5 mL culture of £. strain 
MN538 at a cell density of 0.5 OD600* The inoculum was 
used to start a 100 mL culture in LB media and was grown 

35 to cell lysis. Phage DNA was purified from the culture 
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as described in Sambrook et al. (Molecular Cloning, A 
Laboratory Manual, 2nd ed. (1989) Cold Spring Harbor 
Laboratory Press). DNA from the two clones was digested 
with the following combinations of restriction 
endonucleases: Sal I alone, Sal I+EcoRI, -Sal I+Xba I, 
Sal I+NotI, and Sal I+Bam HI. ' The digests were 
subjected to electrophoresis on 1% agarose for blotting 
to Hybond-N (Amersham) . Southern analysis after 
hybridization to the radiolabeled, random-primed probe 
from p22B as described above revealed that all 
hybridizing sequence from pCANll resided on a 3 kB Sal 
1/Xba I fragment and that all hybridizing sequence from 
PCAN21 resided on a 6 kB Sal I/EcoRI fragment. These 
two fragments were again generated by digestion from the 
corresponding clone, purified from the other fragments 
by electrophoresis on 1% agarose, excised from the gel 
after ethidium bromide staining. and removed from the 
agarose by treatment with Gelase (Epicentre 
Technologies) , phenol extraction and ethanol 
precipitation of the aqueous phase. The fragments were 
ligated into the plasmid Bluescript SK+ (Stratagene) 
which had been double digested with the corresponding 
restriction endonucleases and used to transform 
competent £. Sfili cells. Both the ligation and 
transformation procedures were as described in Example 2 
above. Three positives from pCAN21 and 5 positives from 
pCANll were found and confirmed by purification of 
plasmid DNA and digestion with the endonucleases used to 
generate the ligated inserts . 

The shorter, 3 kB clone was chosen for sequencing 
by the dideoxy method as described in Example 1, above 
using the double-stranded Bluescript plasmid as the 
template. The clone was partially sequenced from the 
genomic insert in the M13 universal primer on 
pBluescript and two primers made corresponding to 
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segments of p22B. That sequence is shown in SEQ ID 
NO: 20. Sequence alignment with p22B (Deveraux et al. 
(1987) Sequence Analysis Package of the Genetics 
Computer Group, University of Wisconsin Biotechnology 
5 Center) showed a sequence identity of 73.6% after the 

insertion of eight gaps. The sections of alignment span § 
1170 bases of the pCANll insert and correspond 

approximately to bases 424 through 1027 in SEQ ID NO:l. [ 
Five of the eight pCANll sequences which do not align 

10 with p22B appear to be introns, the remaining three gaps 
maybe introns or the combination of intron with coding 
regions which are less homologous with p22B. Assuming 
reasonable intron splicing, the resulting open reading 
frame encodes 168 amino acids of the putative 

15 thioesterase. Of these residues, 132 are identical to 



the soybean seed acyl-ACP thioesterase and fifteen 
residues present in the soybean protein are not 
accounted for in the gene from Brassies, Clone pCANll 
thus encodes a large portion of the Brassica acyl-ACP 
20 thioesterase. 



EXAMPLE 6 

p22B AS A PR OBE FOR ACYL-ACP THIOESTERASE GENES FROM 
HP HE A LANCEOLATA AND CHUPEA VISCOSISSIMA ■ 

25 

Genomic clones of acyl-ACP thioesterases from 
Cuphea viscosissima and £UEh£& lanceolata were obtained 
using a polymerase chain reaction (PCR) strategy using 
initiation primers based segments of the sequence of 

30 p22B. Three segments were chosen from the deduced amino 
acid sequence as amino acid sequences encoded by 
relatively non-degenerate DNA codons. These segments 
were synthesized to include all probable DNA sequences 
encoding the amino acid sequence. The sequences 

35 synthesized and their approximate corresponding bases in. 



SEQ ID N0:1 are shown below. Positions at which all 
combinations of multiple bases were synthesized are 
denoted as combinations of bases inside parenthesis. 



TC-58 5'-TA(T/C)AA{G/A)GA(GA)AA((»)TT(T/C)-3' (SEQ ID NO: 14 - 
corresponding to bases 343 through 357 of SEQ ID N0:1> 

TC-59 5«-AA{A/G)TGGGT(A/T/G/C)ATG&T6AA(T/C)CAA-3' (SEQ ID H0:15 - 
corresponding to bases 676 through 696 of SEQ ID N0:1) 

TC-60 5' (C/T)TG(A/G)TTCATCAT{A/T/G/C)ACCCA(T/C)TT-3« 
(SEQ ID N0:16 - corresponding to the complementary strand of TC- 
59) 

TC-61 3 « -CT (T/C) CT (C/A) TT (T/C) GT (A/G) CT (T/C) GT (G/A) GT- 
(T/C)GT(C/T)-5' (SEQ ID NO: 17 - corresponding to complementary 
strand of bases 1125 through 1101 of SEQ ID N0:1) . 

Four PCR reactions were run using buffers, 
deoxynucleotides, TAQ polymerase and reaction conditions 
from a GENEAMP kit (Perkin-Elmer/Cetus) , with 200 ng of 
genomic DNA from either £. laaCfifllfltfl or £. vi PCOSiSS i ma 
as template and either TC-58 and TC-60 or TC-59 and TC- 
61 as the sense and antisense primers. The degenerate 
primers were used at a final concentration of 1 mM. The 
temperature cycling reactions were carried out in a 
Perkin-Elmer/Cetus Thermocycler with the temperature at 
the annealing cycle set to 37°C. The extension and 
denaturation steps were 72°C and 92°C respectively and 
30 cycles were preformed. 

No products were formed with the TC-59/TC-61 primer 
set. A product of about 0.7 kB in size was formed with 
the TC-58/TC-60 primer set using genomic DNA from either 
species as template. The 0.7 kB fragment from both 
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species was purified from several minor products also 
present in the initial PCR reaction and used as the 
template for re-amplification using the same conditions 
as in the initial reaction. Both products amplified and 
5 were again gel purified for blunt-end cloning into Eco jj 
RV-cut and phosphatase-treated Bluescript SK. One 
hundred ng of both fragments were used in a 10 JlL 
ligation reaction at 12°C overnight. One |1L of the 
ligation mix was used to transform 100 |IL of competent 

10 £. coli cells. Transformants were recovered by plating 
on plates containing amphicillin (150 Jlg/mL) to which 
was also added 50 JIL of 5-bromo-4-chloro-3-indolyl-p-D- 
galactopyrannoside (X-gal) (20 Hg/mL) and 10 JIL of lOOmM j_ 
IPTG. Six white colonies were recovered from the h 

15 £. lanceolata transformation and seven from the H 
£. viscogissima transformation. Plasmid DNA was 
prepared from each of the thirteen cell lines and 
digested with restriction endonucleases to excise the 
cloned insert from the Bluescript plasmid. One insert 

20 of the expected size was obtained from both species and . 
double stranded plasmid was prepared from the each of 
the two cell lines for sequencing. 

An 865 base pair insert was sequenced from 
£. viscosissima (SEQ ID NO:22) and an 852 base pair j- 

25 insert was sequenced from £. lanceolata (SEQ ID N0:21) . 
Sequence alignment (Deveraux. et al., (1987) Sequence 
Analysis Package of the Genetics Computer Group, 
University of Wisconson Biotechnology Center) shows that 
the two sequences are 96.6% identical to one another. 

30 Similar alignment of the sequence from £. viscosissima 
with that of p22B (SEQ ID NO:l) shows an overall 
identity of 79.9% with the insertion of thre gaps. The 
gaps appear to be introns and the sequence ends are in 
^ agreement with the sequences of p22B which were used to j 

35 design the PCR primers. Removal of the introns and 
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translation of the resulting open reading frame gives 
two amino acid sequences which are 93% identical to the 
sequence derived from the corresponding base sequence of 
p22B. The two clones are thus partial copies of the 
5 genomic £. atoaataalma and £. lanceo l ata acyl-ACP 
thioesterases. 

P.YAMPLE 7 

. mwffPPnfVPTnw QE SffiCIQSS EQ B IBMSEQBM&IIfiH SE PLANTS FOR 
10 fi T .TPBF.n BKBE SSISH QE RCVT.-arP THTOESTERASE 

S e ngg ^ antissnss P yr>ression construct i ons using the 
C£ansti£3ttiaS-15S promoter. 

The starting vectors for the 35S constructions were 

15 p22B carrying the soybean seed acyl-ACP thioesterase 
gene and pK35K. pK35K was in turn derived from pKNK 
(WO91/09957) . pKNK is a pBR322-based vector which 
contains a neomycin phosphotransferase II (Nptll) 
promoter fragment, a nopaline synthase CMOS) promoter 

20 fragment, the coding region of Nptll and the 

polyadenylation region from the NOS gene. A map of this 
plasmid is shown by Lin et al. (Plant Physiol. (1987) 
84: 856-861). The 320 bp Clal-Bglll fragment in pKNK 
that contains the Nptll promoter was obtained as a 

25 Hindlll-Bglll fragment from the Nptll gene of the 

transposon Tn5 described by Beck et al. (Gene (1982) 19: 
327-336). The Hindlll.site was converted to a Clal site 
by linker addition. The Nptll promoter fragment is 
followed by a 296 bp Sau3A-PstI NOS promoter (NOS/P) 

30 fragment corresponding to nucleotides -263 to +33, with 
respect to the transcription start site, of the NOS gene 
described by Depicker et al. (J. Appl. Genet. (1982) 1: 
561-574) . The PstI site at the 3 r end was created at 
the translation initiation codon of the NOS gene. The 

35 NOS/P is followed by a 998 bp Hindlll-BamHI sequence 



WO 92/11373 PCT/US91/09160 

61 



containing the Kptll coding region obtained from the 
transposon Tn5 (Beck et al., (1982) Gene 19: 327-336 ) 
by the creation of Hindlll and BamHI sites at 

nucleotides 1540 and 2518, respectively. The Nptll j 
5 coding region is then followed by a 702 bp BamHI-Clal h 
fragment containing the 3 1 end of the nopaline synthase f 
gene including nucleotides 848 to 1550 (Depicker et al., 
J. Appl. Genet. (1982) 1: 561-574). The remainder of ~ 
pKNK consists of pBR322 sequences from 29 to 4361. 

10 pKNK was converted to pK35K by removing the Nptll 

and NOS promoters and replacing them with a CaMV 35S 
promoter. The EcoRI-Hindlll 35S promoter fragment is 
the same as that contained in pUC35K (WO91/09957) . The ! 
35S promoter fragment was prepared as follows, and as j_ 

15 described in Odell et al. {Nature (1985) 313: 810-813) U 
except that the 3 1 end of the fragment includes CaMV 
sequences to +21 with respect to the transcription start 
site. A 1.15 kb Bglll segment of the CaMV genome 
containing the region between -941 and +208 relative to 

20 the 35S transcription start site was cloned in the BamHI 
site of the plasmid pUC13. This plasmid was linearized 
at the Sail site in the polylinker located 3' to the 
CaMV fragment and the 3 1 end of the fragment was 
shortened by digestion with nuclease Bal31. Following |_ 

25 the addition of Hindlll linkers, the plasmid DNA was re- \ 
circularized. From nucleotide sequence analysis of the \ 
•isolated clones, a 3' deletion fragment was selected 
with the Hindlll linker positioned at +21. To create 
pK35K this 35S promoter fragment was isolated as an 

30 EcoRI-Hindlll fragment, the EcoRI site coming from the 
polylinker of pUC13, and ligated to pKNK that had been 
' digested with EcoRI and Hindlll, the EcoRI site lying 5 1 

to the Clal site in pBR322. \ 
■ pK35K was digested with BamHI and the cut ends were j 

35 blunted using the Klenow fragment of DNA polymerase. , 



Digestion with HinDIII, then removed the Nptll coding 
region leaving pK35K linearized with a half HinDIII site 
at the 3' end of the 35S promoter sequence and a blunt 
end 5' to the NOS 3« region. Digestion of p22B with the 
combination of HinDIII and EcoRV released a fragment 
which begins twelve bases 5' to the start methionine of 
the soybean seed acyl-ACP thioesterase precursor protein 
and ends in the Blues cript vector just 3' to the 3' non- 
coding region of p22B. Gel purification of both the 
p22B-derived fragment and the modified pK35K fragment as 
described in Example 2 followed by ligation of the 
fragments with T4 DNA ligase gave pKTE9 which contains 
the coding sequence for soybean seed acyl-ACP 
thioesterase linked to the 35S promoter in a manner 
expected to produce a functional enzyme in an 

appropriate cell. 

To produce an expression vector for production of 
antisense message from p22B, pK35K was digested with the 
combination of BamHI and HinDIII to remove the existing 
coding sequence for Kptll and the ends of the remaining, 
linearized plasmid were blunted using the Klenow 
fragment of DNA polymerase. Two XmnI sites exist in 
p22B (at the 5' end coincident with the EcoRI used for 
cloning into pBluescript and spanning bases 1662 through 
1672 at the 3' end of SEQ ID N0:1) so that digestion 
with XmnI removes the entire sequence of p22B including 
the 5 1 and 3' non-coding regions of the cDNA and leaves 
blunt ends. Gel purification of the desired fragments 
as described above followed by blunt end ligation and 
recovery of transformants gave both the sense and 
antisense orientations of p22B 3' to the 35S promoter. 
Orientation of the p22B insert in pK35L was determined 
by restriction mapping using the combination of 
restriction endonucleases EcoRI and. BamHI. The combined 
digestion releases a 1101 base pair fragment (950 bases 
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from the pK35K plasmid and 116 bases from the XmnI 
insert from p22B) in the case of sense orientation of 
p22B with respect to the promoter and a 2365 base pair 
fragment in the case of antisense orientation (950 bases 
5 from pK35K and 1415 bases from the XmnI fragment of t 
p22B) . The antisense orientation construction (pKTER) {.. 
is suitable for use in antisense constructs because it 
contains all of the 5 1 and 3' noncoding regions. 

The soybean somatic embryo transformation described 

10 below requires the use of hygromycin as the selectable 
marker for transformation. To introduce this selectable 
. marker into the vector, a second plasmid pML18 was 
constructed by the introduction of a DNA segment I 
containing the 35S promoter from pK35K 5* to the j- 

15 hygromycin phosphotransferase gene from £. (Gritz h 

et al. Gene (19B3) 25:179) and 3' to the NOS 3' end. 
This segment was ligated into the Sail site of the 
plasmid pGEM9Z (Promega) . To introduce the 35S:acyl-ACP 
thioesterase:NOS construction into pML18, pKTE9 was cut 

20 with AatI and Clal and blunted with the Klenow fragment 
of DNA polymerase. AatI cuts pKTE9 just 5' to the 35S 
promoter and Clal just 3' to the NOS 3' end. Xbal 
linkers were ligated to the blunt ended fragment f the 
fragment was purified by gel electrophoresis and ligated j- 

25 into the cut Xbal site of pML18. After transformation . ? 
and recovery of clones, plasmid DNA was purified from 
< several clones and the construct was restriction-mapped 
to determine the relative orientation of the two 
35S: coding region units. A clone was selected which had 

30 the following orientation: In the poly restriction site 
of pGEM9Z, and oriented 3' to the fl origin of 
replication; at the Xbal site is the 35S promoter 
followed by the coding region of the acyl-ACP \ 
thioesterase gene described above, followed by the NOS j 

35 3 1 end. Beginning at a second Xbal site is the second 
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35S promoter followed by the hygromycin 
phosphotransferase gene and the second NOS 3' end. The 
vector was given the name, pKR12. 

A vector with hygromycin selection and antisense 
5 expression of the acyl-ACP thioesterase message was 
obtained by. a similar strategy. ' To obtain, compatible 
ends on the acyl-ACP thioesterase transcription unit in 
pKTER, the plasmid was digested with EcoRI and Clal 
which released the 35S promoter, p22B derived sequence 

10 and NOS 3' end as a unit. The EcoRI and Clal sites in 
the cloning region of pBluescript were cut and the 
purified EcoRI Clal fragment from pKTER was ligated into 
pBluescript. A clone was isolated from transformed 
£. eoli cells. This clone was cut at the Xbal site 

15 which is in the cloning region of pBluescript to create 
one Xbal end. The Sail site at the other end of the 
insert in pBluescript was digested, blunted and Xbal 
linkers were ligated to it to produce the second end. 
The resulting fragment was purified by gel 

20 electrophoresis and ligated into pML18 which had been 
digested by Xbal as above. A single clone was isolated 
from the transformation. This construction, pKR13, was 
determined by restriction mapping to have the same order 
of the two transcriptions units as described for pKR12. 

25 Vectors for transformation of the acyl-ACP 

thioesterase gene under control of the 35S promoter into 
plant using Arrrnhanterium tffllffiftcleris were produced by 
constructing a binary Ti plasmid vector system (Brevan, 
Nucl. Acids Res. (1984)12:8711-8720). The vector 

30 (pZS199) is based on a vector which contains: (1) ,the 
chimeric gene nopaline synthase/neomycin 
phosphotrasferase as a slectable marker for transformed 
plant cells (Brevan et al., Nature (1984) 304: 184-186), 
(2) the left and right borders of the T-DNA of the Ti 

35 plasmid (Brevan et al., Nucl. Acids Res. (1984) 12:8711- 
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8720) r (3) the £. coll lacZ a-complementing segment 
(Vieria et al., Gene (1982) 19:259-267) with unique 
restriction endonuclease sites for EcoRI, Kpnl, BamHI, 
HinDII, and Sall f (4) the bacterial replication origin j 
5 from the Pseudomonas plasmid pVSl (Itoh et al., Plasmid 
(1984) ll:206-220) r and (5) the bacterial neomycin 
phosphotransferase gene from Tn5 (Berg et al. r Prpc. 
Natl. Acad. Sci. U.S.A. (1975) 72:3628-3632) as a 
selectable marker for transformed £. tumefaciens. The 
10 nopaline synthase promoter in the plant selectable 
marker was replaced by the 35S promoter by a standard 
restriction endonuclease digestion and ligation 
strategy. The 35S promoter is required for efficient 
Brassica napus transformation as described below. 

15 For construction of the antisense expression 

vector, pZS199 was digested with EcoRI and Sail. pKTER 
was also digested with EcoRI and Sail to release the 
35S: antisense acyl-ACP synthase :NOS transcriptional unit 
which was isolated by gel electrophoresis. The 

20 EcoRI/Sall fragment was ligated into the cut pZS199 and 
used to transform £. coli competent cells. Isolation of 
a clone and purification of the plasmid DNA gave the 
binary vector pZKR13. 

The sense 35S construction was assembled by i 

25 removing the acyl-ACP thioesterase coding region and a \ 
portion of the 3 1 untranslated region from p22B by ! 
digestion with HinDII and Sspl. Sspl cuts after base 
1351 in SEQ ID NO:l. The Hindll site was blunted, the 
fragment isolated by gel electrophoresis, and ligated 

30 into the Hindi I/BamHI and blunted version of pK35K 
described above. Clones resulting from the 
transformation of £. coli were restriction-mapped by 
cutting with BamHI and EcoRI. The sense-oriented insert J 
gives a unique 1101 base fragment which is indicative of j 

35 the sense orientation. The resulting plasmid (pKTElO) 
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was cut at the EcoRI and Sail sites described in pK35K 
above and cloned into pBluescript cut .with the same 
restriction endonucleases to give pBTE4 . 

Cloning into the high copy number plasmid pBTE4 
5 allowed isolation of plasmid DNA which was digested with 
Sail and EcoRI. The resulting fragment containing the 
transcriptional unit 35S:acyl-ACP thioest erase :NOS was 
the ligated into pZS199 which had been similarly 
digested to give the desired sense expression vector 
10 pKR12. 

For cloning the thioesterase sequence into existing 
expression vectors containing seed specific promoters, 
an Ncol site was engineered at the start methionine of ^ 
p22B. For this purpose two PCR primers were j- 
synthesized: J 



15 



20 



KR4 0 5' -AAAAATCTAGAAGCTTTCGTGCCATGGCTTGGACC-3 ' )SEQ ID 
NO: 18) corresponding approximately to bases 83 through 
117 in SEQ ID NO:l. This created an Xbal site. 
( substitutions at 89 and 91) and an Ncol site 
(subsitution at 105) . 



KR41 5 •-AGCGTACCGGGATCCGCCTCTA-3 ' (SEQ ID NO: 19) 
corresponding approximately to the complementary strand 

25 of bases 274 through 296 in SEQ ID N0:1. 

The polymerase chain reaction run with these two 
primers and p22B as the template amplified a 213 base 
pair fragment which contained the restriction 
endonuclease cleavage sites described in KR40 as well as 

30 an existing BamHI site in p22B (bases 282 through 287 in 

SEQ ID NO:l) . 

Most of the 3/ untranslated region of p22B was 
removed by digestion with Sspl and HinCII followed by 
re-ligation of the blunt ends to give pBTEB . Both the 
35 PCR amplified fragment and pBTE8 were digested with Xbal 
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and BamHI. The remaining, linearized pBTEB derived 
fragment was purified by gel electrophoresis and the two 
fragments were ligated to give the restriction site 
modified acyl-ACP thioesterase pPTEl. 
5 The 5 1 and 3 1 regulatory sequences from the ■ 

phaseolin gene of Phaseolus vulgaris described by Doyle 
et al. (J. Biol. Chem. (1986) 261:9228-9238) and 
containing the unique restriction endonuclease sites 
Ncol, Smal, Kpnl and Xbal between the 5' and 3' . 

10 regulatory sequences were placed into the HinDII site in 
the cloning region of a pUC18 plasmid (BRL) to give the 
plasmid pCW108. 

The Ncol to Kpnl fragment cleaved from pPTEl and 
purified by gel ectrophoresis r was ligated into pCW108 

15 after digestion with the same two enzymes to give 
plasmid pPHTEl. Removal of the entire phaseolin 
5':acyl-ACP thioesterase tphaseolin 3 1 transcriptional 
unit by digestion with HinDIII, gel purification of the 
fragment and ligation into HinDII I cut pBluescript gave 

20 pPHTE2. Cleavage of pPHTE2 at the EcoRI and Sail sites 
in the cloning region of the original pBluescript 
plasmid released the desired transcriptional unit with 
the EcoRI and Sail sites required for cloning into the 
Binary vector pZS199 as described above to give pZPHTEl. 

25 The promoter region for the 2S2 albumin protein 

from Arabadopsis thaliana was obtained as 1250 base 
pairs 5' to the Ncol site which is coincident with the 
start ATG as described by Krebbers et al, (Plant 
Physiol. (1988) 87:859-866) along with the 750 base pair 

30 coding region ahead of a 1000 base pair 3 1 regulatory 
sequence from the octapine synthase (OCS) gene of 
Aarobacterium (DeGreve et al., J. Mol. Appl. Genet. 
(1982) 1:499-511) all contained in a pUC19 cloning 
vector (BRL) . The 2S albumin coding sequence was 

35 removed from the vector by digestion with Ncol and Xbal 



which cleave at the start ATG and just 3« to the 2S 
albumin stop codon in the OCS 3' regulatory sequence. 
The acyl-ACP thioesterase coding sequence from pTEl was 
removed from the remainder of the plasmid by digestion 
with Ncol and Xbal and purified by gel electrophoresis, 
ligation of the two fragments gave pSTEl. 

A unique EcoRI site at the 5' end of the 2S2 
promoter sequence and a HinDII site 3' to the OCS 3« 
sequence were digested to release the 2S2: acyl-ACP 
thioesterase :OCS transcriptional unit. The fragment was 
purified and ligated into the cut EcoRI and HinDIII 
sites described in pZS199 above to give the binary 
vector pZSTEl. 

EXAMPLE 8 

jEflHSEQBM&IISH som^ T^ gnv *** w EMBRYO CULTURES 

fnltiirP of f,fflV.ryn ? *njr Suspensions 

Soybean embryogenic suspension cultures were 
maintained in 35 mL liquid media (SB55 or SBP6 described 
below) on a rotary shaker, 150 rpm f . at 28°C with mixed 
flourescent and incandescent lights on a 16:8 h 
day/night schedule. Cultures were subcultured every 
four weeks by inoculating approximately 35 mg of tissue 
into 35 mL of liquid medium. 

T rf T«!frtrmation 

Soybean embryogenic suspension cultures were 
transformed by the method of particle gun bombardment 
(see Kline et al. Nature (1987) (London) 327:70). A 
Du Pont Biolistic® PDS1000/HE instrument (helium 
retrofit) was used for these transformations. 
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DNft/Particle Preparation 

To 50 jiL of a 60 mg/mL 1 Jim gold particle 
suspension was added (in order): 5 JIL DNA(1 Jtg/JU-) , 
20 JIL spermidine (0.1 M) f and 50 JIL CaCl 2 (2.5 M). The i 

5 particle preparation was agitated for three min, spun in £ 

§ 

a microfuge for 10 sec and the supernatant removed. The r 
DNA-coated particles were then washed once in 400 uL 70% 
ethanol and resuspended in 40 JIL of anhydrous ethanol. 
The DNA/particle suspension was sonicated three times 
10 for 1 sec each. Five JIL of the DNA-coated gold 

particles were then loaded on each macro carrier disk. 

Bombardment 

1= 

Approximately 300-400 mg of a four-week-old j_ 
15 suspension culture was placed in an empty 60x15 mm petri f± 
dish and the residual liquid removed from the tissue 
with a pipette. For each transformation experiment, 
approximately 5-10 plates of tissue were bombarded. . 
Membrane rupture pressure was set at 1000 psi and the 
20 chamber was evacuated to a vacuum of 71 cm mercury. The 
tissue was placed approximately 8.9 cm away from the 
retaining screen and bombarded three times. Following 
bombardment, the tissue was placed back into liquid and 

cultured. as described above. i- 

* 

25 Eleven days after bombardment, the liquid media was j 

exchanged with fresh SB55 containing 50 mg/mL j 
hygromycin. The selective media was refreshed weekly. 
Seven weeks after bombardment, green, transformed tissue 
was observed growing from untransformed, necrotic 

30 embryogenic clusters. Isolated green tissue was removed 
and inoculated into individual flasks to generate new, 
clonally propagated, transformed embryogenic suspension 
cultures. Thus, each new line was treated as 

independent transformation event. These suspensions i 
35 could then be maintained as suspensions of embryos 



PCT/US91/09160 

WO 92/11373 

clustered in an immature developmental stage through 
subculture or regenerated into whole plants by 
maturation and germination of individual somatic 
embryos. 

5 

Mat matlflP q nri BsrminafclflD 

Transformed embryogenic clusters were removed from 
liquid culture and placed on a solid agar media (SB103) 
containing no hormones or antibiotics. Embryos were 
10 cultured for eight weeks at 26°C with mixed flourescent 
and incandescent lights on a 16:8 h day/night schedule. 
During this period, individual embryos were removed from 
the clusters and analyzed at various stages of embryo 

development . 

15 

Media: 

SB55 and SBP6 Stock Solutions (grams per liter) : 
20 MP Snliate i° ox stpc * 

MgS0 4 7H 2 0 37.0 
MnS0 4 H 2 0 1.69 
ZnS0« 7H20 0.86 
CUSO4 5H 2 0 0.0025 

25 

MS Halidss inny fitoek 

CaCl 2 2H 2 0 44.0 

KI 0- 083 

CoCl 2 6H 2 0 0.00125 

30 KH 2 P04 17.0 

H3BO3 °- 62 

Na 2 Mo04 2H 2 0 0.025 
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# # 

MS PftBPTA 1M Stoefc 

N82EDTA 3.724 
FeS0 4 7H20 2.784 

5 B5 Vitamin Stock 

10 g m-inositol 
100 mg nicotinic acid 
100 mg pyridoxine HC1 
1 g thiamine 

10 

SB55 (per L) 

10 mL each MS stocks 
1 mL B5 Vitamin stock 
0.8 g NH4NO3 
15 3.033 g KNO3 

1 mL 2,4-D (lOmg/mL stock) 
60 g sucrose 
0.667 g asparagine 
pH 5.7 

20 

SEP 6 fper LI 0". 5 mL 2.4-D in SB55 

SB103 (per L> 
MS Salts 
25 6% maltose 

750 mg MgCl2 
0.2% Gelrite 
pH. 5.7 



30 
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BXftMPIE 9 

^npnRarTERiT^i mrhtated T?Wff ;pnRMR ' rI0W 

IflkflggQ transformation 
5 The binary vectors pKR12 f p2STEl, and pPHTE were 

transferred by a freeze/thaw method (Holsters et al., 
Mol Gen Genet (1978) 163:181-187) to the ftgrnhacter i um 
strain LBA4404/pAL4404 (Hockema et al., Nature (1983) 
303:179-180) . The ftm-nnarterium transformants were used 
10 to inoculate tobacco leaf disks (Horsch et al., Science 
(1985) 227:1229-1231). Transgenic plants were 
regenerated in selective media containing kanamycin. 

Rrassir-a na nus transformation 
15 Seeds of cultivar R. nanus Westar were surface 

sterilized with a solution of 10% Clorox®, 0.1% SDS and 
placed on germination media consisting of 30 mM CaCl2, 

1.5% agar for 5 to 7 days. 

Three mL cultures of fim-nharterium tUfflafac i ens 

20 (strain LBA 4404) containing the desired binary vector 
constructions were grown for 18 to 20 h in Min A media 
at 28°C. To begin the transformation, plates of co- 
cultivation media (BC-1 with 100 \SM acetosyringone) were 
poured and allowed to air-dry in a laminar flow hood. 

25 Seedling hypocotyls were cut into 1 cm segments and 
placed into 22.5 mL of bacterial dilution medium (MS 
liquid media with 100 fiM acetosyringone) . To the 
solution containing the hypocotyl segments was added 
2.5 mL of the overnight culture of flgrpbacteriunv- After 

30 30 min the hypocotyl segments were removed and placed, 
10 per plate, on the co-cultivation media plates. The 
plates were then incubated at 25°C for three days in dim 
light. 

After three days the segments were transferred to 
35 selective media plates (BC-1 media with 200 mg/L 
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carbenicillin and 50 mg/mL kanamycin) . Callus growth 
occured at the cut ends of the hypocotyls over the next 
20 days, and after 20 days calli- greater than 5 mm in 
diameter were transferred to selective regeneration 
5 media (BS-48 containing 200 mg/L carbenicillin). At the 
same time, the remaining hypocotyl segments were 
transfered to fresh selective media and additional calli 
developing over the next 15 days were also transferred 
to the selective regeneration media. All calli produced 

10 were thus transferred to selective regeneration media by 
72 days after the co-cultivation with Agrobacterium. 

Individual calli on selective regeneration media 
were maintained in continous light at 25°C, and placed 
on fresh media at two week intervals. If no shoot 

15 primordia appeared after six weeks on the regeneration 
media, the calli were chopped into 5 mm pieces,^ re- 
plated on BC-1 media containing 200 mg/L carbenicillin 
for three days, then transferred back to BS-48 media 
with 200 mg/L carbenicillin. Shoots appeared three to 

20 six weeks after calli were transferred to BS-48 media. 

Shoots formed on BS-48 were allowed to elongate 
somewhat before excision and plating on MSVA-1A media. 
Shoots were transferred to fresh MSVA-1A media for a 
second, three-week cycle before transplanting directly 

25 into potting mix. 

Media (amounts/L) 

BC-1MS minimal organic salts medium (MS salts + 100 mg/L 
30 i-inositol and 0,4 mg/mL thiamine) 



30 g sucrose 
18 g mannitol 
3 mg Kinetin 



35 



3 g DNA grade Agarose 
adjusted to pH 5.8 
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MS minimal Organic Medium (as above) 
B5 vitamines (1 mL of 1000X stock, described above) 
5 250 mg xylose 

10 g glucose 
0.6 g MES 

4 g DNA grade agarose 

10 adjust to pH 5.7 and add from sterile solutions after 
autoclaving; 2 mg Zeatin and 0.1 mg indole acetic acid 

MSVft-lft 

MS minimal organic salts medium 
15 10 g sucrose 

B5 vitamins (1 mL of 1000X stock, described above) 
6 g DNA grade agarose 
adjust to pH 5.8 

20 

pttMPLE 10 
AUAT.YSTS OF IBMSfiEHIC PLANTS 

aaalgSlS Somat.ir Soybean Embrvos 
25 While in the globular embryo state in liquid 

culture as described in Example 8, somatic soybean 
embryos contain very low amounts of triacylglycerol or 
storage proteins typical of maturing, zygotic soybean 
embryos. At this developmental stage, the ratio of 
30 total triacylglyceride to total polar lipid 

(phospholipids and glycolipid) was about 1:4, as is 
typical of zygotic soybean embryos at the developmental 
stage from which the somatic embryo culture was 
initiated. At the globular stage as well, the mRNAs for 
35 the prominant seed proteins a 1 subunit of |3-conglycinin, 
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Kunitz Trypsin Inhibitor III and Soybean Seed Lectin 
were essentially absent. Upon transfer to hormone-free, 
solid media to allow differentiation to the maturing 
somatic embryo state as described in Example 8, 
5 triacylglycerol became the most abundant lipid class. t 
Similarly, mRNAs for a'-subunit of {J-conglycinin, f 

Kunitz Trypsin Inhibitor III and Soybean Seed Lectin 
became very abundant messages in the total mRNA 
population. In these respects, the somatic soybean 

10 embryo system behaves very similarly to maturing zygotic 
soybean embryos in vivo * and is therefore a good and 
rapid model system for analyzing the phenotypic effects 
of modifying the expression of genes in the fatty acid I 
biosynthesis pathway such as acyl-ACP thioest erase and h 

15 for predicting the alteratons expected in zygotic 

embryos. Similar zygotic embryo culture systems have 
been documented and used in another oilseed crop, 
rapeseed (Taylor et al. f Planta (1990) 181:16-26). 

20 Assay for in vitro thioesterase activity from globular 

stage somatic soybean cultures of Example 8 

, Uniform clumps from eighteen of the twenty-one 
transformed lines obtained in Example 8 along with 3 
non-transformed controls, were placed in tared, 1.5 mL h 

25 microfuge tubes and re-weighed to obtain the tissue ! 

i 

fresh weight. Two times the tissue weight in an 
extraction buffer consisting of 0.1 M Tricine (pH 8.2), 
0.5 mM EDTA and 1 mM DTT was added and the tissue piece 
was homogenized with a small pestle. The homogenate was 

30 centrifuged to clear and 2 \LL of the supernatant was 
added to an assay mixture consisting of 35 \LL of the 
above Tricine buffer also containing 1 mg BSA/mL and 
1 fiM [ 14 C]-oleoyl-ACP (58 mCi/mmol) The reaction was 
stopped after 2 min by the addition of 100 of 10% j 

35 acetic acid in 2-propanol. Hydrolyzed, 14 C-oleate was 
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extracted from the mixture by two, 1 ml extractions with 
water-saturated hexane and taken for scintillation 
counting. Extracted protein was determined by the 
Bradford assay (Biorad) using 2 JlL of the extract. The 
5 results of these assays are shown in Table 4. 



10 



15 



20 



25 



30 
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TABLE 4 

THIOESTERASE SPECIFIC 
CULTURE LINE ACTIVITY 
IDENTIFICATION (nmol-mg protein" 1 min' 1 ) 



Control 


0.92 


Control 


1.21 


Control 


1.08 


194-5/4 


0.69 


194-6/5 


0.75 


194-6/1 


0.39 


194-3,5,6-1 


0.30 


194-5/2 


0.34 


194-6/4 


0.88 


194-5/1 


0.76 


194-3,5, 6-2 


0.78 


194-3,4,6-3 


0.39 


194-1/2 


0.73 


194-4/1 


0.41 


194-2/2 


1.12 


194-6/2 


1.09 


194-6/3 


0.92 


194-1/4 


0.15 


194-1/1 


0.38 


194-5/3 


0.88 


194-5/5 


0.20 ' 
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These results were unexpected because, the acyl-ACP 
thioesterase gene was introduced in such a manner as to 
encode, the sense message from the gene and therefore the 
production of additional acyl-ACP thioesterase protein 
5 and corresponding additional enzymatic activity. In the 
introduction of a gene into tissue in which that same 
gene or one very highly homologus to it is expressed, 
cosuppressive inhibition of both messages is a 
possibility. Another factor which was considered is 

10 that the control tissue in this experiment was not 
transformed and grown on the selective media as 
described in Example 8. It is possible that the 
selective media suppresses thioesterase activity and 
that the controls utilized were improper. To test for 

15 this possiblity tissue clumps from the selected, 

transformed lines were removed from media containing the 
selective agent (hygromycin) and allowed to re-grow in 
liquid culture. Assays were again performed as 
described above. The result was identical: no 

20 transformed lines exibited thioesterase-specific 
activities higher than control cultures. When the 
thioesterase specific activities for transformed lines 
grown on selective media was plotted against the 
specific activity for the same line grown without - 

25 hygromycin the correlation coefficient was 0.85. It is 
logical to conclude that the suppression of acyl-ACP 
thioesterase activity is a function of transformation. 

flnrl-Wn analy^'fi nf se lprffri lines of transformed 
30 gnyhpan so^^ wnhrvo cultures 

Nine of the twenty-one somatic embryo lines chosen 
as representative of lines with greatly decreased acyl- 
ACP thioesterase-specific activity, of lines with only 
moderately decreased activity, and of lines which do not 
35 appear to be different from untransformed controls were 
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grown for further study. Total RNA was obtained from 
transformed soybean somatic embryo cultures by the 
Phenol/SDS Method (Current Protocols in Molecular 
Biology, Ed. F. M. Ausubel et al., (1991) John Wiley and 
5 Sons, pg. 4,3,1-4,3,3). 

Poly A + mRNA was isolated by oligo dT affinity 
chromatography as described by Aviv et al. (Proc. Nat. 
Acad. Sci. U.S.A. (1972) 69:1408-1412. Two Jig of polyA 4 
mRNA was separated from each transformed soybean culture 

10 line in a denaturing formaldehyde gel for an RNA blot 
analysis as described by Lehrach et al. (Biochemistry 
(1977) 16:4743-4749). Standards containing known amounts 
of pure acyl-ACP thioesterase mRNA were included in the 
gel for, quantitation of acyl-ACP thiosterase mRNA in the 

15 transgenic lines. The standard was synthesized in vitro 
using the method of Krieg et al., Nucl. Acids Res. 
(1984) 12:7057-7070) with p22B as the template DNA. The 
gel-separated mRNA was transfered to Nytran filter and 
hybridized with 32 P-labelled soybean thioesterase RNA 

20 probes as described by Berger et al. (Methods Enzymol. 
(1987) 152:577-582), again using p22B as the template. 
Hybridization was at 68°C in 50% formamide, 0.5 M NaCl, 
lOx Denhardt's, 0.2% SDS, 250 Jlg/mL yeast RNA. The 
filter was washed at 68°C once in 2xSSC 30 min and four 

25 times in 0.2xSSC at 68°C 30 min each. The filter was 
exposed to X-ray film overnight at -80°C with a Du Pont 
Cronex® intensifying screen. 

The construction of pKR12 (see Example 7) deletes a 
portion of the 5" untranslated region of the soybean 

30 seed acyl-ACP thioesterase message. As a result, the 
expected message size for expressed acyl-ACP 
thioesterase transgene is about 200 base pairs smaller 
than the message from expressed endogenous genes. In 
all nine lines a message of about 1.6 kB in size was 

35 present in the somatic soybean embryos. In all but line 
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194-6/3 a second message of about 1.4 kB in size was 
also present. After probing with the acyl-ACP 
thioesterase probe, the blots were stripped of labelling 
by continued washing as above and re-probed with [ 32 P]- 
5 RNA prepared as described above but using a Bluescript 
plasmid containing the cDNA for soybean seed Oleosin in 
the insert. The oleosin message is highly expressed in 
the somatic soybean embryos and was used to normalize 
the amount of mRNA loaded from each line. The two lines 

10 expressing greatly reduced acyl-ACP thioesterase 

activity (194-5/5 and 194-1/4) also had greatly reduced 

levels of both the transgene acyl-ACP thioesterase 

message and the endogenous acyl-ACP thioesterase f 

message. Lines 194-1/4 and 194-5/4 had slightly reduced . g 

15 levels of both messages although it appeared that the 
endogenous message was decreased in relation to the 
transgene message. The level of both messages was 
somewhat lower in line 194-5/4 than in line 194-1/4. 
Line 194-6/3 had only the endogenous message but lines 

20 194-6/4 and 194-6/5 had high levels of both the 

transgene and endogenous gene messages, while all three 

of these lines had acyl-ACP thioesterase activities at 

or near the wildtype level. The single message signal |_ 

in 194-6/3 is explained by the lack of an introduced j 

25 acyl-ACP thioesterase gene in this line (see Southern j 
analysis below) but the lack of effect of the expressed 
acyl-ACP thioesterase meassage in lines 194-6/4 and 194- 
6/5 is not simply explained. The reduced message levels 
in the remaining lines correlates exactly with reduced 

30 acyl-ACP thioesterase activity and are diagnostic of co- 
supression as seen when highly homologous messages of 
slightly differing size are produced (van der Krol 
et al., The Plant Cell (1990) 2:291-299). j 
The sequence of SEQ ID N0.:1 or any nucleic acid ' 

35 fragment substantially homologous therewith is therefore 
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shown to be effective in reducing acyl-ACP thioesterase 
activity by cosuppresion when re-introduced into soybean 
and expressed in an appropriate expresson vector. 

5 Southern ana lysis of genomic DNA from transformed fc 
somatic soybean embryos j~ 

Genomic DNA was isolated from maturing somatic 
embryos from the 7 surviving lines described below and 
digested with Xbal as described in Example 3. Southern 
10 analysis was also done as in Example 3 using either the 
acyl-ACP thioesterase coding sequence as the probe 
template or the neomycin phosphotransferase coding 
sequence as the probe template. Using the coding j. 
sequence of acyl-ACP thioesterase as the probe revealed y 
15 that all lines except 194-6/3 contained introduced f 
copies of the sequence in addition to the endogenous 
copies. All lines except 194-5/5 had at least one copy 
which was not rearranged from the introduced pKR12 
construction. Line 194-5/5 had multiple inserts, all of 
20 which had undergone some rearrangement. Probing with 
the neomycin phosphotransferase coding sequence showed 
that all transformed lines had at least one copy of the 
selectable marker. Occurrence of copies ranged from one 
in the case of line 194-6/3 to eight in the case of line h 
25 194-5/5. I 

i 

Analysis of fattv acid profiles and triacvlalvcerol 
synthesis in transformed soybean somatic emhrvos 

Seven of the transformed lines from Example 8 were 
30 successfully grown on solid, hormone free, maturation 
media. These lines were used for growth rate analysis, 
analysis of the rate of triacylglycerol synthesis, and 
analysis of the fatty acid profile of the 

triacylglycerol. j 
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Following placement on the maturation media and 
subsequent differentiation of the globular embryo 
culture into the maturing embryos, four replicate 
samples of five embryos each per line were taken at 
5 intervals. The length of time to differentiation varied 

with culture line, but embryos produced by each line j 
were of very similar fresh and dry weights at the point 
of differentiation. This point, at which differentiated 
embryos could be easily removed from the remaining 
10 globular culture was designated as "time 0" in the 
course of triacylglycerol synthesis and dry weight 
accumulation. 

Embryos from each line and time point were weighed j 
for fresh weight, lyophilized and re-weighed for dry j 

15 weight and lipid extraction. An internal standard of ! 
tri-heptadecanoyl glycerol was prepared by reacting the - ; 

acid chloride of heptadecanoic acid with glycerol in 
dimethylformamide (DMF) with triethyl amine. The 
triacylglyceride was purified by passage through silica, 

20 crystalized from diethyl ether and used to make a 

0.5 mg-rn" 1 standard solution in 2-propanol. Addition of 

100 JIL of the standard solution to the extraction 

solvent for each sample gave an internal standard of 50 

jig which was co-purified, derivatized and I 

25 chromatographed with the extracted lipid. In addition j 
to the internal standard solution, the embryos were 
ground in 0.5 mL of diethyl ether and centrifuged. The 
ether layer was removed and the extraction was repeated. 
The combined extracts were passed through a prepared 

30 silica column (Sep Pak silica cartridge, Millipore) and 
the neutral lipid fraction was eluted with 2 mL of 
diethyl ether. The column eluate was taken to dryness 
under an N2 stream and neutral lipids in the residue j 
were transesterified to methanol in 0.5 mL of 1% sodium j 

35 methoxide in methanol. One mL of a saturated NaCl 
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solution was added and the fatty acid methyl esters were 
extracted into diethyl ether. The ether solutions were 
taken to dryness under an N2 stream and the extracted 
methyl esters were re-disolved in between 50 and 200 fiL 
5 of hexane (depending on the embryo age) for analysis by 
GLC. GLC seperations were done isothermally at 185° on 
a fused silica capillary column (stationary phase, SP- 
2330 r 30 M in length, Supelco, Bellefonte, PA) . Data 
were analyzed by integration relative to the assigned 

10 weight of the internal standard peak to determine both 
the absolute weight of total fatty acids in 
triacylglycerol and the relative contribution of each of 
the five most prominant fatty acids in soybean 
triacylglycerol . 

15 The specific activity of acyl-ACP thioesterase was 

also analyzed in the maturing embryos at mid-maturation 
by the method described above. The relative 
contributions of individual fatty acids to the total 
fatty acid profile, total amount of triacylglyceride 

20 synthesized, and the specific activity of the acyl-ACP 
thioesterase for the seven transformed lines and one 
untransformed control are given in Table 5. 
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TABLE 5 



Total 



Thioesterase 



% of Total 



Fatty Acids 



Triacyl- 
glycerol 



Sp. Activity 
(nmol-mg^xniiT* 



Cell Line 


16:0 


18:0 


18:1 


18:2 


18:3 


(% dry wt.) 




Control 


14.9 


3.5 


9.9 


56.6 


15.0 


5.7 


1.07 


194-6/3 


16.5 


3.4 


9.8 


50.8 


18.9 


5.8 


1.06 


194-6/4 


13.8 


3.0 


9.1 


56.6 


16.5 


7.6 


0.98 


194-6/5 


11.1 


2.9 


10.7 


57.0 


15.6 


ND 


0.80 


194-2/4 


13.2 


2.7 


10.6 


59.1 


13.6 


6.2 


0.70 


194-5/4 


9.8 


3.0 


11.0 


57.8 


16.8 


6.9 


0.64 


194-1/4 


17.4 


3.7 


7.6 


5i.5 


19.6 


3.6 


0.24 


194-5/5 


178.2 


4.6 


4.7 


46.2 


25.8 


2.4 


0.22 



5 The fatty acid profile values in Table 5 are the 

means of four to six determinations. The thioesterase 
specific activities are the means of three assays, two 
done at the globular tissue stage and one at the 
developing embryo stage. 

10 The results show that the nucleotide sequence of 

SEQ ID NO:l is effective in altering seed storage lipid 
biosynthesis. Moderate reduction of the acyl-ACP 
thioesterase activity does reduce the level of saturated 
fatty acid in triacylglycerol (the 16:0 value for line 

15 194-5/4 is signifigantly lower than the control values) . 
Fold reduction of the acyl-ACP thioesterase activity in 
the range of 5 or greater leads to additional effects; 
the total accumulation of triacylglycerol was 
signifigantly decreased and it is likely that the rate 

20 of triacylglycerol synthesis was also decreased. 
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Analysis of tobacco transformed with constitutive and 
seed specific constructions 

Tobacco plants transformed with pKZ12 f pKZ13, 
p2STEl f and pPHTEl (see Example 9) were analyzed for 
5 acyl-ACP thioesterase activity. Those plants tranformed 
with the constitutive constructs pKZ12 and pKZ13 were 
analyzed at the callus level, at the seedling stage just 
after transfer to pots, and in the developing seeds. 
Seven developing plants were obtained from 

10 transformations with pKR12 and six from transformations 
with pKR13. Of the seven pKR12 transformants, two 
showed acyl-ACP thioesterase specific activity that was 
higher than control plants in very young seedlings. One 
of those plants (KR12-4B) maintained measureably higher 

15 levels of thioesterase activity in developing seeds. 
Tobacco seeds undergo a marked, developmental change in 
seed acyl-ACP-thioesterase activity. Since it is 
difficult to determine seed developmental age with 
accuracy, determining increased thioesterase activity 

20 relative to controls is also imprecise. Nevertheless, 
it appeared that plant KR12-4B retained about a two-fold 
increased acyl-ACP thioesterase specific activity in the 
seeds. Twenty-two immature seeds from the segregating 
population of seeds on KR12-4B were individually assayed 

25 for thioesterase activity on a per seed basis. Three 
individuals of the twenty-two had acyl-ACP thioesterase 
activity in the range of 1.2 to 1.7 nmol/10 min/seed. 
Five seeds had activity in the range from 3.3 to 3.9 
nmol/10 min/seed, while the remaining fourteen fell in 

30 the range between 2 and 3 nmol/10 min/seed. This ratio 
is reasonably near the 1:2:1 ratio that would be 
predicted for the segregating population from a single 
effective transgene insert if each gene dose of the 
transgene gives acyl-ACP thioesterase activity 
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approximately equal to that from the endogenous gene in 
this plant. 

Eleven tobacco plants transformed with p2STEl and 
six tobacco plants transformed with pPHTEl have been 
5 assayed for acyl-ACP thioesterase activity in developing 
seeds. Of the p2STEl transformed plants, five did not 
appear to be different from wildtype in activity, three 
were clearly higher than wildtype and three others were 
not at developmental stages which allowed comparison. 

10 The three transformants which had higher activity were 
judged to have 1.9, 2.0 and 2.6 times the acyl-ACP 
thioesterase-specific activity of untransformed controls 
at an equivalent develomenta} stage. Of the six pPHTEl 
transformants assayed, two could not be compared 

15 - reliably due to their immature developmental stage, two 
were approximately equal to wild-type, and two had 
higher activity. These two lines measured 2.5 and 2.9 
fold higher than equivalent control seeds. 

Applicants have shown that either constitutive or 

20 seed specific expression of the soybean seed acyl-ACP 
thioesterase gene in a plant may give increased acyl-ACP 
thioesterase activity provided that endogenously 
expressed acyl-ACP thioesterases are not excessively 
homologus to the introduced gene. . 



25 
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an alysis ol transf£nnsd-£iassifia "apus 

ftrassir-a napus transformed with pKRl3 as described 
in Example 9 was analyzed for acyl-ACP thioesterase 
specific activity at the stage of transformed callus 
after re-induction on hormonal media as described in 
Example 9. Calli from twenty-eight individual 
transformants along with four control calli were assayed 
by grinding the callus with a pestle in a 1.5 mL 
microfuge tube after addition of a buffer concentrate 
35 consisting of 10 HL of 0.1 M Tricine, pH 8 and 10 mM 
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DTT. The homogenate was centrifuged to clear and 5 pi 
of the supernatant was used in the acyl-ACP thioesterase 
assay as described above. The assay value for each 
transformant was compared to the control average and 
5 then placed In classes of 10% intervals to produce 
frequency distribution. See Table 6. 

TABLE 6 

10 CLASS 

LI Of control} FREQUENCY 

110-90 7 

89-80 3 

79-70 4 

15 . 69-60 7 

59-50 2 

49-40 1 

39-30 1 

29-20 3 

20 
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SEQUENCE LISTING 



(1) GENERAL INFORMATICS: 

(i) APPLICANT: Hltz f William D. 

Yadav, Narendra 5. 

(ii) TITLE Or INVENTION: Nucleotide Sequences of Soybean Acyl-ACP 
Thioesteraae Genes 

(iii) NUMBER OF SEQUENCES: 22 

(iv) CORRESPONDENCE ADDRESS : 

(A) ADDRESSEE: E. I. du Pont de Nemours and Company 

(B) STREET: 1007 Market Street 

(C) CITY: WiJaiington 

(D) STATE: Delaware 

(E) COUNTRY: U.S.A 

(F) ZIP: 19898 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/631,264 

(B) FILING DATE: 20-DEC-1990 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Morrissey f Bruce W. 

(B) REGISTRATION NUMBER: 30,663 

(C) RErERENCE/DOCKET NUMBER: CR-8926-A 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (302) 992-4927 

(B) TELEFAX: (302) 892-7949 

(C) TELEX: 835420 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1602 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA to xnRNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 
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(A) ORCTflSM: Glycine max 

(B) STRAIN: Cultivar Mye 

(D) DEVELOPMENTAL STAGE: Early seed fill 

(E) HAPLOTYPE: Diploid 

(F) TISSUE TYPE: Cotyledon 
(I) ORGANELLE: Nucleus 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: cDNA to mRNA 

(B) CLONE: 22B 

(ix) FEATURE: 

(A) NAME/KEY: »atj>eptide 

(B) LOCATION: 271.. 1206 

(C) IDENTIFICATION METHOD: Catalytically active when 
expressed in E. coli 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 106.. 1209 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

CCTTCTTTCT CATTCTCATA CGCACCCAGT CACCCAGCTT TCCCTTTTTC CTATTTTTTT 60 

TCTCTTTTTT TATTAAAAAA ATAAAAATGT TGAAGCTTTC GTGCA ATG GCT TGG 114 

Met Ala Trp 
-55 

ACC GGG CTC ACT CCC TGG CCC AAT GCG CTT CCG GGC CGG CCC GCC TGC 162 
Thr Gly Leu Thr Pro Trp Pro Asn Ala Leu Pro Gly Arg Pro Ala Cys 
-50 -45 -40 

GCC GTC CCT CGC CGG AGG AGG AGO GGC GTC TCC GGA TTC CGG TTG CCG 210 
Ala Val Pro Arg Arg Arg Arg Ser Gly Val Ser Gly Phe Arg Leu Pro 
-35 * -30 -25 

GAA GGC AGG TCG ATC CGG GTG TCC GCG GCG GTG TCG GCA AAG GAC GGC 258 
Glu Gly Arg Ser He Arg Val Ser Ala Ala Val Ser Ala Lys Asp Gly 
-20 & -15 -10 -5 

GCG GTG GCG ACC CGG GTA GAG GCG GAT CCC GGT ACG CTG GCG GAC CGG 306 
Ala Val Ala Thr Arg Val Glu Ala Asp Pro Gly Thr Leu Ala Asp Arg 
1 5 10 

CTG AGG GTG GGG AGC TTG ACG GAG GAT GGG TTG TCT TAT AAG GAG AAG 354 
Leu Arg Val Gly Ser Leu Thr Glu Asp Gly Leu Ser Tyr Lys Glu Lys 
15 20 25 



TTC ATT GTG AGG AGC TAC GAA GTT GGG ATC AAT AAG ACT GCC ACT GTT 402 
Phe He Val Arg Ser Tyr Glu Val Gly He Asn Lys Thr Ala Thr Val 
30 35 40 

GAA ACC ATT GCC AAT CTC TTG CAG GAG GTT GGA TGT AAT CAT GCT CAG 450 
Glu Thr He Ala Asn Leu Leu Gin Glu Val Gly Cys Asn His Ala Gin 
45 50 55 60 

AGT GTT GGA TAT TCT ACT GAT GGT TTT GCA ACC ACC CCT ACG ATG AGA 498 
Ser Val Gly Tyr Ser Thr Asp Gly Phe Ala Thr Thr Pro Thr Met Arg 
65 - 70 75 
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SaS=S5 = SS5SS2SSS. 

Gly Glu Gly Arg Val Gly Thr Arg Arg Asp Phe lie Leu *ys asp ay 

1X0 3- 15 
GCA ACI GAT GAA GTT ATT GGA AGG GCA ACA AGC AAA TGG GTA ATG ATG 
Ala Thr Asp Glu Val He Gly Arg Ala Thr Ser Lys Trp Val Met Met 
125 xo ° 

145 " 

cue TAT TTG GTT TTC TGT CCT CGA GAG CCC AGG TTA GCT ATT CCA GAG 
£u Su SS Phe Cys Pro Arg Glu Pro Arg Leu Ala lie Pro Glu 
160 165 * ,u 

175 1B0 > B5 

CAS TAT TCC AGA CTT GGA CTX GTG CCA AGA A6A GCG GAT CTG GAC ATG 
Sn Tyr. Sr Su Gly Leu Val Pro Arg Arg Ala Asp Leu Asp Met 



190 



AAT CAG CAT GTT AAC AAT GTC ACC TAT ATT GGA TGG GTG CTT GAG AGC 
Asn Gin His Val Asa Asn Val Thr Tyr lie Gly Trp val Leu exu &er 



546 



594 



642 



690 



738 



786 



834 



882 



930 



978 



1026 



225 23U 

SAT TAC AGA CGA GAG TGC GGA CAA CAT GAC ATA GTC GAT TCC CTC ACT 
Sp Sr iS 2g Su Cys Gly Gla HI. Asp lie Val Asp Ser Leu Thr 

rpjj »ip« cag GGT GGT GCC GAG GCA GTT CCA GAA CTG AAA 1074 
£ Si Su £ S Sn S 5 Ala Glu Ala Val Pro Glu Leu Lys 

255 260 " 265 

GGT ACA AAT GGA TCT GCC ACG GCA AGG GAA GAC AAA CAT GAA CAC CAG 1122 
S? S i£"S Ser Ala Thr Ala Arg Glu Asp Lys His Glu His Gla 

270 213 
CAG TIT CTG CAT CTA CTT AGG TTG TCT ACT GAA GGA CTT GAG ATA AAC 1170 
£n Se Su Ss Su Leu Arg Leu Ser Thr Glu Gly Leu Glu lie Asn 
285 290 Z 

CGG GGA CGA ACA GAA TGG AGA AAG AAA GCT CCA AGA TGAGAACCAT 1216 
Arg Gly Arg Thr Glu Trp Arg Lys Lys Ala Pro Arg 
305 3 10 
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TATGTGTGCT TCCACCCGAA TCCATGATTC T G TTTTTGTC TTGTGTTGTT TCATGTTACC 1276 

AGGGTTGTCT TATCAATTTT CCCTTGATAT TTTGCTTAGA GTTTGTGCGC TTAATAGGGA 1336 

TTGAAGAGTT AAAATATTGC TTCTGTTTTC TTGTCATGCT GATCAAAAAT TTAAGTTGTC 1396 

CAAATCCCGT AGTTA6GCTA TATAGGTTGA CATCAATCTC TGATCCATTA GTATCAGATT 1456 

CCATGAATGT CATTGTACCT TAAGGGAGCA TAGAAATCCA GGAAGTTGGT ATGGATCTGC 1516 

CATCTACTGC ATGACTTGAA CAATGTGTGT TAAAATAATC ATTTTGAAAT AATTCAATTA 1576 

GCTAATTATT AATGTTCTTA AAAAAA . 1602 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 367 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ala Trp Thr Gly Leu Thr Pro Trp Pro Asn Ala Leu Pro Gly Arg 
-55 -50 -45 -40 

Pro Ala Cys Ala Val Pro Arg Arg Arg Arg Ser Gly Val Ser Gly Phe 
-35 -30 -25 

Arg Leu Pro Glu Gly Arg Ser He Arg Val Ser Ala Ala Val Ser Ala 
-20 " -15 -10 

Lys Asp Gly Ala Val Ala Thr Arg Val Glu Ala Asp Pro Gly Thr Leu 

-5 ,1 5 

Ala Asp Arg Leu Arg Val Gly Ser Leu Thr Glu Asp Gly Leu Ser Tyr 

10 * 15 20 25 

Lys Glu Lys Phe He Val Arg Ser Tyr Glu Val Gly He Asn Lys Thr 
30 35 40 

Ala Thr Val Glu Thr He Ala Asn Leu Leu Gin Glu Val Gly Cys Asn 
45 50 55 

His Ala Gin Ser Val Gly Tyr Ser Thr Asp Gly Phe Ala Thr Thr Pro 
60 65 70 

Thr Met Arg Lys Leu Arg Leu He Trp Val Thr Ala Arg Met His He 
75 80 85 

Glu He Tyr Lys Tyr Pro Ala Trp Ser Asp He Val Glu He Glu Thr 
90 95 100 105 

Trp Cys Gin Gly Glu Gly Arg Val Gly Thr Arg Arg Asp Phe He Leu 
110 115 120 

Lys Asp Tyr Ala Thr Asp Glu Val He Gly Arg Ala Thr Ser Lys Trp 
125 130 135 
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V.1 Met Met Asn Gin Asp Thr Arg Arg Leu Gin Lys V.1 Ser Asp A*, 
140 145 

V.1 Lys Glu Glu Tyr Leu Val Phe Cys Pro Arg Glu Pre Arg Leu Ale 

- — 160 



155 



II. Pro Glu Ale Asp Ser Asa Ser Leu Ly, Ly. lie Pro Ly. Leu Glu 



170 



175 



Asp Pro Ale Gin Tyr Ser Arg Leu Gly Leu V.1 Pro Arg Arg Ale Asp 



190 



Le« Asp Met A.n Gin Hi. V.1 A»n A*n V.1 Thr Tyr lie Gly Trp V.1 



205 



Leu Glu Ser Met Pro Gin Glu lie lie Asp Ser His Glu Leu Gin Ser 

220 225 
lie Thr Leu Asp Tyr Arg Arg Glu Cys Gly Gin His Asp He V.1 Asp 



235 



240 



Ser Leu Thr Ser V.1 Glu Al. He Gin Gly Gly Ale Glu Al. V.1 Pro 

250 255 

Glu Leu Lys Gly Thr Asn Gly Ser Al. Thr Ale Arg Glu Asp Lys His 
270 

Glu His Gin Gin Phe Leu His Leu Leu Arg Leu Ser Thr Glu Gly Leu 
285 

Glu lie Asn Arg Gly Arg Thr Glu Trp Arg Ly. Lys Al. Pro Arg 

300 305 

(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1476 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

<iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Glycine max 

(B> STRAIN: Cultivar Wye . 

(D) DEVELOPMENTAL STAGE: Early seed fill 

(E) HAPLOTYPE: Diploid 

<F) TISSUE TYPE: Cotyledon 
(I) ORGANELLE: Nucleus 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: cDNA to mRNA 

(B) CLONE: 4C 
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(ix) FEAT.. 

(A) NAME /KEY: Mt_peptide 

(B) LOCATION: 282.. 1217 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 117.. 1220 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: p 

CCTTCAAAAC CACTTGTTTC TTCAGTTCCA CTCTGCTTCT TCCCCTTTCT CTTCTCATAC 60 

TCACCCAGCT TTCCTTTTTA TTAAAAAACA AAAAAAAATG TTGAAGCTTT CGTGCA 116 1. 

ATG GCT TGG ACC GGG CTC ATA TGC TGG CCC AAT GCG TTT GCG GGC CGG 164 
Met £. Trp Thr Gly Leu lie Cys Trp Pro Asn Alt Ph. Ala Gly Arg 
-55 "50 "45 " 40 

GGC CGC TGC GCT CGT CCC AGC CGG AGG ATA AGC GGC ATC TCC GGA TTC 212 
Gly Arg Cys Al. Arg Pro Ser Arg Arg lie Ser Gly lie Ser Gly Phe 
-35 "30 "25 

TGG TCC CCG GAA GGA GGG CGG ATC CGG GTG TCG GCG GTG GTG TCG GCG 260 {= 

5 £r S3 £u £y By Arg lie Arg Val Ser Ala Val Val Ser Al. ^ 

AAG GAT GGC GCG GTG GCG ACC CGG GTG GAG GCG GAG TCC GGG ACG CTG 308 
Lys Asp Gly Al. V.1 Al. Thr Arg Val Glu Ala Glu Ser Gly Thr Leu 
-5 1 5 

GCG GAC CGG CTG AGG GTG GGG AGC TTG ACG GAG GAT GGG TTG TCT TAC 356 
Ala Asp Arg Leu Arg Val Gly Ser Leu Thr Glu Asp Gly Leu Ser Tyr 
10 15 20 « 

AAG GAG AAG TTC ATT GTG AGG AGC TAC GAA GTT GGG ATC AAT AAG ACT 404 
Lys Glu Lys Phe He Val Arg Ser Tyr Glu Val Gly He Asn Lys Thr 
1 30 35 40 

GCC ACT GTT GAA ACC ATT GCT AAT CTC TTG CAG GAG GTT GGA TGT AAT 452 
£. Thr Val Glu Thr lie Ala Asn Leu Leu Gin Glu Val Gly Cys Asn 
45 50 55 

CAT GCT CAG AGT GTT GGA TAT TCT ACT GAT GGT TTT GCA ACC ACC CCT 
His Al. Gin Ser V.1 Gly Tyr Ser Thr Asp Gly Phe Ala Thr Thr Pro 
60 65 70 



500 j 

i 
i 



ACG ATG AGA AAA TTG CGT CTC ATA TGG GTT ACT GCT CGC ATG CAC ATT 548 
Thr Met Arg Lys Leu Arg Leu He Trp Val Thr Ala Arg Met His He 
75 80 85 



eu iTC TAC AAA TAC CCT GCT TGG AGT GAC GTT GTT GAG ATA GAG ACA 596 
Su S Sr £s i?r pS Ala Trp Ser Asp Val Val Glu lie Glu Thr 
90 95 100 105 

TGG TGC CAA GGT GAA GGA AGG GTT GGG ACA AGG CGT GAT TTT ATA CTG 644 
Trp Cys Gin Gly Glu Gly Arg Val Gly Thr Arg Arg Asp Phe lie Leu 
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AAA GAC TAT GCA ACT GAT GCA GTC ATT GGA AGG GCA ACA AGC AAA TCG 692 
Lys Asp Tyr Ala Ser Asp Ala Val lie Gly Arg Ala Thr Ser Lys Trp 
125 130 135 

GTA ATG ATG AAT CAG GAC ACC AGA CGA CTC CAG AAA GTT TCT GAT GAT 740 
Val Met Met Asn Gin Asp Thr Arg Arg Leu Gin Lys Val Ser Asp Asp 
140 145 150 

GTT AAA GAA GAG TAT TTG GTT TTC TGT CCT CGA GAG CCC AGG TTA GCA 788 
Val Lys Glu Glu Tyr Leu Val Phe Cys Pro Arg Glu Pro Arg Leu Ala 
155 160 165 

ATT CCA GAG GCA GAT AGC AAT AAC TTG AAG AAA ATA CCG AAA TTG GAA 836 
lie Pro Glu Ala Asp Ser Asn Asn Leu Lys Lys He Pro Lys Leu Glu 
170 175 180 185 - 

GAC CCT GCC CAG TAT TCC AGA CTT GGA CTT GTG CCA AGA AGA GCG GAT 884 
Asp Pro Ala Gin Tyr Ser Arg Leu Gly Leu Val Pro Arg Arg Ala Asp 
190 195 200 

CTG GAC ATG AAT CAG CAT GTT AAC AAT GTC ACC TAT ATT GGA TGG GTG 932 
Leu Asp Met Asn Gin His Val Asn Asn Val Thr Tyr He Gly Trp Val 
205 210 215 . 

CTT GAG AGC ATG CCT CAA GAA ATC ATT GAT AGT CAT GAG TTG CAG AGT 980 
Leu Glu Ser Met Pro Gin Glu lie He Asp Ser His Glu Leu Gin Ser 
220 225 230 

ATT ACC TTG GAT TAC AGA CGA GAG TGC GGA CAG CAT GAC ATA GTT GAT 1028 
He Thr Leu Asp Tyr Arg Arg Glu Cys Gly Gin His Asp He Val Asp 
235 240 245 

TCC CTC ACT AGT GTG GAA GAA ATC CAG GGT GGT GCC GAG GCA GTT TCA 1076 
Ser Leu Thr Ser Val Glu Glu lie Gin Gly Gly Ala Glu Ala Val Ser 
250 255 260 265 

GAA CTG AAA AGT ACA AAT GGA TCT GCC ATG GCA AGG GAA GAC AAA CAT 1124 
Glu Leu Lys Ser Thr Asn Gly Ser Ala Met Ala Arg Glu Asp Lys His 
270 275 280 

GAA CAC CAG CAG TTT CTG CAT CTA CTT AGG TTG TCT ACT GAA GGA CTT 1172 
Glu His Gin Gin Phe Leu His Leu Leu Arg Leu Ser Thr Glu Gly Leu 
285 290 295 

GAG ATA AAC CGG GGA CGA ACG GAA TGG AGA AAG AAA GCT CCA AGA TGAGAACCAT 
1227 

Glu lie Asn Arg Gly Arg Thr Glu Trp Arg Lys Lys Ala Pro Arg 
300 305 310 

TACGTGTGCT TCCACCCAAA TCCATGATTC TGTTTTTGTC TTTCTTGTGT TGTTTCACGT 1287 

TACCAGGGTT ATGAACTTAT CAATTTTCCC TTTATATTTT GCTTAGAGTT TGTGGACCCT 1347 

TAATAGGGGA TTGGAGGAGT TAAAATTTTG TCGCTGTTTT CTTGTCATGC TCACAAATTT 1407 

AAATTGTTGG AATTCATCAT CAAGCTTATC GATACCGTCG ACCTCGAGGG GGGGCCCGGT 1467 



ACCCAATTC 



1476 
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(2) INFORMATION ^Rl SEQ ID NO; 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 367 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Ala Trp Thr Gly Leu He Cys Trp Pro Asn Ala Phe Ala Gly Arg 
-55 -50 -45 -40 

Gly Arg Cys Ala Arg Pro Ser Arg Arg He Ser Gly lie Ser Gly Phe 
-35 -30 -25 

Trp Ser Pro Glu Gly Gly Arg He Arg Val Ser Ala Val Val Ser Ala 
-20 *" -15 -10 

Lys Asp Gly Ala Val Ala Thr Arg Val Glu Ala Glu Ser Gly Thr Leu 

-5-1 5 

Ala Asp Arg Leu Arg Val Gly Ser Leu Thr Glu Asp Gly Leu Ser Tyr 

10 * 15 20 25 

Lys Glu Lys Phe He Val Arg Ser Tyr Glu Val Gly He Asn Lys Thr 
30 35 40 

Ala Thr Val Glu Thr He Ala Asn Leu Leu Gin Glu Val Gly Cya Asn 

45 50 . 55 

His Ala Gin Ser Val Gly Tyr Ser Thr Asp Gly Phe Ala Thr Thr Pro 
60 €5 70 

Thr Met Arg Lys Leu Arg Leu He Trp Val Thr Ala Arg Met His He 
75 80 85 

Glu He Tyr Lys Tyr Pro Ala Trp Ser Asp Val Val Glu He Glu Thr 
90 95 100 105 

Trp Cys Gin Gly Glu Gly Arg Val Gly Thr Arg Arg Asp Phe He Leu 
110 115 120 

Lys Asp Tyr Ala Ser Asp Ala Val He Gly Arg Ala Thr Ser Lys Trp 
125 130 135 

Val Met Met Asn Gin Asp Thr Arg Arg Leu Gin Lys Val Ser Asp Asp 
140 145 150 

Val Lys Glu Glu Tyr Leu Val Phe Cys Pro Arg Glu. Pro Arg Leu Ala 
155 160 165 

He Pro Glu Ala Asp Ser Asn Asn Leu Lys Lys He Pro Lys Leu Glu 
170 175 180 185 , 

Asp Pro Ala Gin Tyr Ser Arg Leu Gly Leu Val Pro Arg Arg Ala Asp 
190 195 200 

Leu Asp Met Asn Gin His Val Asn Asn Val Thr Tyr He Gly Trp Val 
205 210 215 
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Leu Glu Ser Met Pro Gin Glu lie He Asp Ser His Glu Leu Gin Ser 

220 225 23 

He Thr Leu Asp Tyr Arg Arg Glu Cys Gly Gin His Asp He Val Asp 
235 240 245 

Ser Leu Thr Ser Val Glu Glu lie Gin Gly Gly Ala Glu Ala Val Ser 
250 255 260 265 

Glu Leu Lys Ser Thr Asn Gly Ser Ala Met Ala Arg Glu Asp Lys His 

270 275 280 

Glu His Gin Gin Phe Leu His Leu Leu Arg Leu Ser Thr Glu Gly Leu 

285 290 z « 

Glu He Asn Arg Gly Arg Thr Glu Trp Arg Lys Lys Ala Pro Arg 

300 305 310 



(2) INFORMATION FOR SEQ ID NO:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Arg Val Glu Ala Pro Gly Gly Thr Leu Ala Asp Arg Leu 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

lie Glu He Tyr Lys Tyr Pro Ala Trp Leu Asp He Val Glu He 
1 5 10 15 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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Val Glu Ala Pro Gly Gly Thr Leu Ala Asp 
15 10 

(2) INFORMATION rOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GTKGARGCNC CWGGWGGNAC NYTKGCAKA 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: modified base 

(B) LOCATION: 15 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION: /evidence- EXPERIMENTAL 



(ix) FEATURE: 

(A) NAME/KEY: modified base 

(B) LOCATION: 9 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION: /evidence- EXPERIMENTAL 



(ix) FEATURE: 

(A) NAME/KEY: modified base 

(B) LOCATION: 18 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION: /evidence- EXPERIMENTAL 



(ix) FEATURE: 

(A) NAME/KEY: modified base 

(B) LOCATION: 21 ~ 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION: /evidence- EXPERIMENTAL 



/modjbase- i 



/mod base- i 



/modjbase- i 



/modjbase- i 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GTKGARGCNC OTGGNGGNAC NYTKGCAKA 
(2) INFORMATION FOR SEQ ID NO:10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

lie Glu lie Tyr Lys Tyr Pro Ala Trp Leu Asp He Glu He 
1 5 10 

(2) INFORMATION FOR SEQ ID NO:ll: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
ATNGARATNT AYAARTAYCC NKCNTGGYTN GAYATNGARA TN 42 
(2) INFORMATION FOR SEQ ID NO:12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: modified_base 

(B) LOCATION: 3 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION: /evidence- EXPERIMENTAL 

/modjbase- i 

(ix) FEATURE: 

(A) NAME /KEY: modified_base 

(B) LOCATION: 9 

(C> IDENTIFICATION METHOD: experimental 
(D) OTHER INFORMATION: /evidence- EXPERIMENTAL 
/mod base- i 
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(lx) FEATURE: 

(A) NAME/KEY: modified base 
<B) LOCATION: 21 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION: /evidence- EXPERIMENTAL 

/modjbaae- i 

(ix) FEATURE: 

(A) NAME/KEY: modified base 

(B) LOCATION: 24 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION: /evidence- EXPERIMENTAL 

/modjoaae- i 

(ix) FEATURE: 

(A) NAME/KEY: modified base 

(B) LOCATION: 30 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION: /evidence- EXPERIMENTAL 

/mod_base- i 

(ix) FEATURE: 

(A) NAME/KEY: modified base 

(B) LOCATION: 36 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION: /evidence- EXPERIMENTAL 

/mod_base- i 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
ATNGARATNT ATAARTATCC NGCNTGGTTN GATATNGARA T 41 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GTGTGGAAGC GATACAGGGT GGTGCCGAGG C 31 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
TAYAARGARA AKTTY 
(2) INFORMATION TOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TXPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
AARTGGGTNA TGATGAAYCA A 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
YTGRTTCATC ATNACCCAYT T 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
YTGYTGRTGY TCRTGYTTMT CXTC 
(2) INFORMATION FOR SEQ ID NO:18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: IB: 



AAAAATCTAG AAGCTTTCGT GCCATGGCTT GGACC 



35 



(2) INFORMATION TOR SEQ ID NO: 19: 



(1) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
AGCGTACCGG GATCCGCCTC TA 22 
(2) INFORMATION FOR SEQ ID NO: 20: 

" (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1378 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brassica napus 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

ATGACCATGA TTACGCCAAG CTCGAAATTA ACCCTCACTA AAGGGAACAA AAGCTGGAGC €0 

TCCACCGCGG TGGCGGCCGC TCTAGAACTA GTGGATCCCC CGGGCTGCAG GAATTCGGCA 120 

CGAGAAGAAC TTTGTTGTT C GTTTGATGTA GGTTAGGAGG TGGGATGTAA TCAGTTTCAG 180 

AGCGTTGTAT TTTTGACTGA TGGGTTTGCG ACAACACCTA CCATGAGGAA ACTGAATCTC 240 

ATTTGGGTCA CTTCGAGAAT GCACATTGAG ATCTACAGAT ATCCAGCTTG GTATTGTTTT 300 

TTTTTTTTCT TTTTGGCTGC GTATGTTTTG ATGACAACAA ATGAGTTGAA TTCTTAAAAA 360 

TTTTGGTTAC AGGGGTGATG TGGTTGTCAG AGTGAAGAAG GATAGCGACA AGGCGTGACT 420 

GGATTCTTAA GGACATTGCT AACCGGCGAA TTCACTGGCC GCAGTACTAG GTTTCCTTCT 480 

CATCATTGTT TGCTTTCTCC ATTGGTTTGT GCAATGGAAT AAAATTTTCT TATGTTAAAG 540 

ATATAAGTTT CTGTCACTTG GGTTTATGGG ACTGTCCTGA TTAGTTGTAC CTATGTGTTA €00 

CCGTTTCAGC AAGTAGGTGA TGATGAACCA AGACACAAGA CGGCTACAGA AAGTTTCTGA €60 

TGATGTTCGG GACGAGCACT TGAT GTTTT G TCCTAAAGAA CCCAGGTAAA AGAACTTTGT 720 

GCCAATGCAA TGTTTGCTGG TCAATCATAT CGTTATATTC ATGAATTGCC AACTATTCTG 780 
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TTTATTGTAT ATCTTTGTAG ATTAGCATAT CCTGAGGAGG AAAATACCAG AAGCTTGAAG 
AATATCCCCA AACTCGAAGA TCTGGCCAAG TACTCAATCA TTGGACTTAA GGTATAAAAT 
AGAACAATAA GATTCTTTGT AAGAATCAAC ATTCCTAAAG GACTTTATAA TCATGTTTCT 
TTGCAGCCAA GAGCGAGCTG ATCTCGGCAT GAACCATCAT GTCAATARTG TCACATATAT 
TGGATGGCTT CTTGAGGTTA GTGTCATCAT CAGCTTCAGT AATAATCATA TGAGCATACC 
TCAAGAGTTA TAGACACGCA CGAACTTCAG GTCATAACTT TGGATTACAG ACGAGAATTT 
AGCAAGACGA TGTGGTGGAT TCATTGACCA CCTCAAAGAA TGGCTCTGCA ACATCAGGCA 
CACAAAGCCA CAACAATACC CAGTTCTTAC ATCTCCTAAG GTTGTAGGTT GAAAGAACTA 
TGAAGTGGTG AGCTGCAGAT CTTTGCATGT GCAGAGGGTT GTAGGTGGGG GCCTTAGCAG 
GGAGGTGTAC GTTGTGTCAT TGAATAACTC GAGGGGGGGC CCGGTACCCA ATTCGCCC 
(2) INFORMATION FOR SBQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 852 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

<A) ORGANISM: Cuphea lanceolate 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 
CCCCCGGGCT GCAGGAATTC GATTACAAGG AGAAATTCAT TGTAAGATGC TACGAGGTCG 
GTATTAACAA GACAGCCACC GTCGAGACCA TGGCAAATCT TTTGCAGGTC TCTTTCTTGC 
ATGCATGCAT CGTCAGGTTT CTGGGCATTG GTGATTTGCT TGTATTAATT TACATGTCAA 
ATTTAATATT TCCTTGTCTC CGACATGCAA CACCATTTTT TTTTCTTTAA ATGTTCACTT 
TGGATACAGG AAGTAGGTTG TAACCATGCT CAGAGTATTG GATTCTCAAC CGATGGTTTT 
GCGACGACCA CTACCATGAG AAAATTGAAT CTGATATGGG TTACTCGTCG AATGCACATA 
GAAATTTACA AGTACCCAGC ATGGTTAGTT AGTTCTTTCC ACTCTCTTTC TTCATCTCCC 
CAGCCACCCC ACTGCTAACT TTTTGATTGA CAATTGTTGA TACGTACTCT AGGGGTGATG 
TGGTTGAAAT TGAGACTTGG TGCCAAAGTG AAGGAAGAAT TGGAACAAGA AGGGATTGGA 
TTCTCAAGGA CTATGCTAAT GGTGATGTTA TTGGAAGAGC CACAAGGTAG ACAGACTGCT 
CTCTCATATA TACAGCAGTG AGAGAACAAA AGAATAATAT TGGAACAATA TCAAATCGAA 
TCTAAACAAT TGGAAGACAT TATTTTGAGG AAAGGGAAGA TTGAAACTGA TGTTCTTAGT 
AATCTATACG. TGCACGGCGC CATGATTATC CATTTCATGA GAATTGTTCC AATCATTTAT 
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ATTAATCTGT TTTCAGCAAG TGGGTCATGA TGAATCAAAT CAAGCTTATC GATACCGTCG 840 



(2) INFORMATION TOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 865 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cuptoea viscosissima 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

CCCCCTCGAG GTCGACGGTA TCGATAAGCT TGATTATAAG GAGAAGTTTA TTGTCAGATG 60 

CTACGAGGTC GGTATTAACA AGACAGCCAC CGTCGAGACC ATGGCAAATC TTTTGCAGGT 120 

CAGGTTCTCT CTGTTTCCAT ATCGTTGCAT GCATGCATCG GTTTCTGGGC ATTGGTTATT 180 

TGCTTGTATT AATTTACATG TCAAAATTTA ATTTAATATT TCCTTGTCTC CGACATGCAA 240 

CACCATTTTT TTTTTTAAAT GTTCACTTTC AATGCAGGAA GTAGGTTGTA ACCATGCTCA 300 

GAGTCTTGGA TTCTCAACCG ATG GTTTT GC GACGACCACT ACCATGAGGA AATTGAATCT 360 

GATATGGGTT ACTGCTCGAA TGCACATAGA AATTTACAAG TACCCAGCAT GGTTAGTTAG 420 

TTCTTCCACT CTCTTTTCTT CATCTCCCCA GCCACCCCAC TGCACTTTTT GATTGACAAT 480 

TGTTGGATAC GTCTCTAGGG GTGATGTGGT TGAAATTGAG ACTTGGTGCC AAAGTGAAGG 540 

AAGAATCGGA ACAAGAAGGG ATTGGATTCT CAAGGACTAT GCTAATGGTG AAGTTATTGG 600 

AAGAGCCACA AGGTAGACAG ACTGCTCTCA TATATACATC AGTGAGATAA CAAAGGGAAT 660 

AATATTGGAA CAATATCAAA TCGAATCTAA ACAATTGGAA GACATTATTT TGAGCAAGTG 720 

AAGATTGAAA CTGATGTTCT TAGTAATCTA TACGTGCACG GCGCCATGAT TATCCATTTC 780 

ATGAGAATTG TTCCAATCAT TTATATTAAT CTGTTTTCAG CAAATGGGTG ATGATGAACC 840 

AAATCGAATT CCTGCAGCCC GGGGG 665 



ACCTCGAGGG GG 



852 



What is claimed is: 

1. An isolated nucleic acid fragment comprising 
nucleotide sequence encoding a plant acyl-ACP 
thioesterase. 

2. An isolated nucleic acid fragment of Claim 1 
wherein said fragment is isolated from a plant selected 
from the group consisting of soybean, oil producing 
Brassica species, Cuohea yj ftfinsissima and £i2Eh£& 
lanceolata . 

3. An isolated nucleic acid fragment comprising 
nucleotide sequence encoding the soybean seed acyl-ACP 
thioesterase cDNA corresponding to the nucleotides 1 to 
1602 of SEQ ID N0:.l f or any nucleic acid fragment 
substantially homologous therewith. 

4 . An isolated nucleic acid fragment comprising 
nucleotide sequence encoding the soybean seed acyl-ACP 
thioesterase cDNA corresponding to the nucleotides 1 to 
1476 of SEQ ID NO: 3, or any nucleic acid fragment 
substantially homologous therewith. 

5. An isolated nucleic acid fragment of Claim 3 
wherein said nucleotide sequence encodes the soybean 
seed acyl-ACP thioesterase precursor corresponding to 
nucleotides 106 to 1206 of SEQ ID N0:1, or any nucleic 
acid fragment substantially homologous therewith. 

6. An isolated nucleic acid fragment of Claim 4 
wherein said nucleotide sequence encodes the soybean 
seed acyl-ACP thioesterase precursor corresponding to 



nucleotides 117 to 1217 of SEQ ID N0:3, or any nucleic 
acid fragment substantially homologous therewith. 

7. An isolated nucleic, acid fragment of Claim 5 
wherein the said nucleotide sequence encodes the mature 
soybean seed acyl-ACP thioesterase enzyme corresponding 
to nucleotides 271 to 1206 of SEQ ID NO:l, or any 
nucleic acid fragment substantially homologous 
therewith. 

8. An isolated nucleic acid fragment of Claim 6 
wherein the said nucleotide sequence encodes the mature 
soybean seed acyl-ACP thioesterase enzyme corresponding 
to nucleotides 282 to 1217 of SEQ ID NO:3, or any 
nucleic acid fragment substantially homologous 
therewith. 

9. A chimeric gene causing altered levels of 
acyl-ACP thioesterase activity in a transformed plant 
cell, the gene comprising a nucleic acid fragment of 
Claim 1 operably linked to suitable regulatory 
sequences. 

10. A chimeric gene causing altered levels of 
mature seed acyl-ACP thioesterase enzyme in a 
transformed microorganism, the gene comprising a nucleic 
acid fragment of Claim 7 operably linked to suitable 
regulatory sequences. 

11. A plant transformed with the chimeric gene of 
Claim 9. 

12. Oil obtained from the plants containing the 
chimeric genes of Claim 9. 



# 106 S£ 

13. A method of producing soybean seed oil 
containing altered levels of palmitic and stearic acids 
comprising: 

(a) transforming a plant cell of an oil-producing 
species with a chimeric gene of Claim 9, 

(b) growing fertile soybean plants from the 
transformed plant cells of step (a) , 

(c) screening progeny seeds from the fertile 
soybean plants of step (b) for the desired levels of 
palmitic and stearic acids, and 

(d) processing the progeny seed of step (c) to 
obtain oil containing altered levels of palmitic and 
stearic acids. 

14 . A method of Claim 13 wherein the plant cell of 
an oil-producing species is selected from the group 
consisting of soybean, oil seed Brassica species, 
sunflower, cotton, cocoa, peanut, saf flower, and corn. 

15 . A method of producing mature soybean seed 
acyl-ACP thioesterase enzyme in microorganisms 
comprising: 

(a) transforming a microorganism with a chimeric 
gene of Claim 10 , and 

(b) growing the transformed microorganism of step 
(a) to produce mature soybean seed acyl-ACP thioesterase 
enzyme . 

16. A method of RFLP breeding to obtain altered 
levels of palmitic and stearic acids in soybean seed oil 
comprising: 

(a) making a cross between two soybean varieties 
differing in the trait, 
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(b) making a Southern blot of restriction enzyme 
digested genomic DNA isolated from several progeny 
plants resulting from the cross of step (a); and 

(c) hybridizing the Southern blot with the 

5 radiolabeled nucleic acid fragment of Claim 3 or 4. 
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